| Literature DB >> 31569548 |
Abstract
The aim of the present study was to determine the lowest cut-off value for albuminuria levels, which can be used to detect diabetic kidney disease (DKD) using the urinary albumin-to-creatinine ratio (UACR). National Health and Nutrition Examination Survey (NHANES) data for US adults were used, and participants were classified as having diabetes or prediabetes based on a self-report and physiological measures. The study dataset comprised 942 diabetes and 524 prediabetes samples. This study clarified the significance of the lower albuminuria (UACR) levels, which can detect DKD, using an artificial intelligence-based rule extraction approach. The diagnostic rules (15 concrete rules) for both samples were extracted using a recursive-rule eXtraction (Re-RX) algorithm with continuous attributes (continuous Re-RX) to discriminate between prediabetes and diabetes datasets. Continuous Re-RX showed high test accuracy (77.56%) and a large area under the receiver operating characteristics curve (75%), which derived the two cut-off values (6.1 mg/g Cr and 71.00 mg/g Cr) for the lower albuminuria level in the UACR to detect early development of DKD. The early cut-off values for normoalbuminuria (NA) and microalbuminuria (MA) will be determined to help detect CKD and DKD, and to detect diabetes before MA develop and to prevent diabetic complications.Entities:
Keywords: artificial intelligence; cut-off; diabetic kidney disease; microalbuminuria; normoalbuminuria; rule extraction
Year: 2019 PMID: 31569548 PMCID: PMC6963949 DOI: 10.3390/diagnostics9040133
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Summary of the National Health and Nutrition Survey (NHANES) diabetes dataset and the definitions of diabetes and prediabetes.
| Attribute | Possible Values or Ranges | |
|---|---|---|
| Age (years) | 20–80+ | Continuous |
| Sex | Male, Female | Binary |
| Race/ethnicity |
Mexican-American Other Hispanic Non-Hispanic White Non-Hispanic Black Other race | Nominal |
| Systolic blood pressure (mmHg) | 62–228 | Continuous |
| Diastolic blood pressure (mmHg) | 0–118 | Continuous |
| Waist circumference (cm) | 40.2–177.9 | Continuous |
| Body mass index (kg/m2) | 12.1–82.9 | Continuous |
| Total cholesterol (mg/dL) | 69–813 | Continuous |
| Urine albumin-to-creatinine ratio (mg/g Cr) | 0.21–9600 | Continuous |
| Glycohemoglobin (%) | 3.5–17.5 | Continuous |
| Tobacco use | Every day; some days; not at all | Nominal |
| Alcohol (average no. alcoholic drinks/day) | 1–25 | Continuous |
| Exercise to lose weight | No, Yes | Binary |
| Triglycerides (mg/dL) | 13–4233 | Continuous |
| LDL-cholesterol (mg/dL) | 14–375 | Continuous |
| Direct HDL-cholesterol (mg/dL) | 8–138 | Continuous |
| Fasting plasma glucose (mg/dL) | 51–421 | Continuous |
| Insulin (μU/mL) | 0.14–682.48 | Continuous |
| Total bilirubin (mg/dL) | 0.1–7.1 | Continuous |
The following terms were defined: Diabetes, fasting plasma glucose (FPG) ≥ 126 mg/dL (7.0 mmol/L); fasting, no caloric intake for at least 8 h or hemoglobin A1c (HbA1c) ≥ 6.5% (48 mmol/mol); and prediabetes, FPG 100–125 mg/dL (5.6–6.9 mmol/L; impaired fasting glucose) or HbA1c 5.7–6.4% (39–47 mmol/mol). All tests were performed in a laboratory using the method of the National Glycohemoglobin Standardization Program certified and standardized to the Diabetes Control and Complications Trial assay. NHANES, National Health and Nutrition Examination Survey; LDL, low-density-lipoprotein; HDL, high-density-lipoprotein.
Comparison of participants with prediabetes and diabetes from the NHANES diabetes dataset.
| Attribute | Prediabetes (SD) | Diabetes (SD) | p-Value |
|---|---|---|---|
| Age (years) | 55.04 (15.79) | 60.01 (13.54) | <0.0001 |
| Sex | Male (1): 45.80% | Male (1): 50.74% | 0.0069 |
| Race/ethnicity | Mexican-American: 13.74% | Mexican-American: 21.54% | <0.0001 |
| Systolic blood pressure (mmHg) | 125.47 (18.10) | 130.57 (20.15) | <0.0001 |
| Diastolic blood pressure (mmHg) | 69.18 (13.14) | 68.15 (14.99) | 0.0086 |
| Waist circumference (cm) | 104.66 (15.75) | 107.73 (15.73) | <0.0001 |
| Body mass index (kg/m2) | 30.95 (7.05) | 31.74 (6.93) | 0.0018 |
| Total cholesterol (mg/dL) | 198.19 (40.73) | 183.31 (40.37) | <0.0001 |
| Urine albumin-to-creatinine ratio (mg/g Cr) | 8.85 | 14.85 | 0.337 |
| Glycohemoglobin (%) HbA1c | 5.92 (1.05) | 7.38 (1.81) | <0.0001 |
| Tobacco use | Every day: 14.88% | Every day: 14.33% | 0.88 |
| Alcohol (average # alcoholic drinks/day) median | 1 | 1 | 0.002 |
| Exercise to lose weight | Yes: 30.34% | Yes: 26.64% | 0.13 |
| Triglycerides (mg/dL) | 134.76 (68.22) | 145.92(71.38) | 0.0001 |
| LDL-cholesterol (mg/dL) | 118.79 (36.29) | 104.18 (34.82) | <0.0001 |
| Direct HDL-cholesterol (mg/dL) | 52.47 (14.88) | 49.93 (13.63) | <0.0001 |
| Fasting plasma | 108.0 | 139.8 | <0.0001 |
| Insulin (mU/mL) | 12.1 | 12.74 | 0.21 |
| Total bilirubin (mg/dL) | 0.730 (0.28) | 0.72 (0.28) | 0.20 |
Figure 1Schematic overview of the recursive-rule eXtraction algorithm with continuous attributes (continuous Re-RX). NN, neural network; C4.5, C4.5 decision tree.
Figure 2Detailed flow chart of the continuous Re-RX. NN, neural network; C4.5, C4.5 decision tree; D, discrete attributes (variables); C, continuous attributes (variables); BP, backpropagation; δ1, cover rate; δ2, error rate.
Accuracies after CV for the NHANES diabetes dataset.
| NHANES Diabetes Dataset | TR ACC (%) | TS ACC (%) | # Rules | AUC-ROC (%) |
|---|---|---|---|---|
| Continuous Re-RX [10 × 5 CV] | 79.65 ± 0.62 | 77.56 ± 2.19 | 15.82 | 75 |
CV, cross-validation; Re-RX, recursive-rule eXtraction; continuous Re-RX, Re-RX algorithm with continuous attributes; TR, training dataset; TS, testing dataset; ±, standard deviation; ACC, accuracy; AUC-ROC, area under the receiver operating characteristic curve; 10 × 5 CV, 10 runs of five-fold cross-validation.
Fifteen concrete rules that were used to discriminate between prediabetes and diabetes datasets using the artificial intelligence (AI)-based rule extraction approach.
| IF Part | Condition 1 | Condition 2 | Condition 3 | Condition 4 | Condition 5 | THEN |
|---|---|---|---|---|---|---|
| R1 | HbA1c ≤ 5.8 | FPG ≤ 122.7 | LDL ≤ 101.0 | Mexican-American = 0 | Class 2 (Prediabetes) | |
| R2 | HbA1c ≤ 5.8 | FPG ≤ 122.7 | LDL ≤ 101.0 | Mexican American = 1: (Yes) | AGE ≤ 40 | Class 2 (Prediabetes) |
| R3 | HbA1c ≤ 5.8 | FPG ≤ 122.7 | LDL ≤ 101.0 | Mexican-American =1: (Yes) | AGE > 40 | Class 1 (Diabetes) |
| R4 | HbA1 ∈ (5.8, 6.1) | FPG ≤ 122.7 | LDL ≤ 101.0 | Class 1 (Diabetes) | ||
| R5 | HbA1c ≤ 6.1 | FPG ≤ 122.7 | LDL > 101.0 | Class 1 (Diabetes) | ||
| R6 | HbA1c ≤ 5.6 | FPG > 122.7 | UACR ≤ 71.00 | Class 2 (Prediabetes) | ||
| R7 | HbA1c ≤ 5.6 | FPG > 122.7 | UACR > 71.00 | Class 1 (Diabetes) | ||
| R8 | HbA1c ∈ (5.6, 6.1) | FPG > 122.7 | LDL ≤ 151.0 | Class 1 (Diabetes) | ||
| R9 | HbA1c ∈ (5.6, 6.1) | FPG > 122.7 | LDL > 151.0 | Class 2 (Prediabetes) | ||
| R10 | HbA1c ∈ (6.1, 6.4) | LDL ≤ 142.0 | FPG ≤ 108.5 | Non-Hispanic Black = 0: (No) | Class 1 (Diabetes) | |
| R11 | HbA1c ∈ (6.1, 6.4) | LDL ≤ 142.0 | FPG ≤ 108.5 | Non-Hispanic Black = 1: (Yes) | UACR ≤ 6.1 | Class 2 (Prediabetes) |
| R12 | HbA1c ∈ (6.1, 6.4) | LDL ≤ 142.0 | FPG ≤ 108.5 | Non-Hispanic Black = 1: (Yes) | UACR > 6.1 | Class 1 (Diabetes) |
| R13 | HbA1c ∈ (6.1, 6.4) | LDL ≤ 142.0 | FPG > 108.5 | Class 1 (Diabetes) | ||
| R14 | HbA1c ∈ (6.1, 6.4) | LDL > 142.0 | Class 2 (Prediabetes) | |||
| R15 | HbA1c > 6.4 | Class 1 (Diabetes) |
FPG, fasting plasma glucose; LDL, low-density lipoprotein; UACR, urine albumin-to-creatinine ratio.
Comparison of the cost of a laboratory UACR test and a semi-quantitative UACR test in several countries.
| Japan | UK | The US | |
|---|---|---|---|
| Cost of laboratory test for UACR | JPY 1080 [ | ₤7.4 [ | $16 [ |
| Cost of a POC testing for a semi-quantitative UACR | JPY 230 [ | ₤2.31 [ | --- |
UACR: Urine albumin-to-creatinine ratio; POC: point of care.