| Literature DB >> 32498677 |
Guoxiang Xie1,2, Xiaoning Wang1,3, Runmin Wei4, Jingye Wang4, Aihua Zhao5, Tianlu Chen5, Yixing Wang3, Hua Zhang1,3, Zhun Xiao1,3, Xinzhu Liu1,3, Youping Deng4, Linda Wong4, Cynthia Rajani4, Sandi Kwee4, Hua Bian6, Xin Gao6, Ping Liu7,8,9, Wei Jia10,11,12.
Abstract
BACKGROUND: Accurate and noninvasive diagnosis and staging of liver fibrosis are essential for effective clinical management of chronic liver disease (CLD). We aimed to identify serum metabolite markers that reliably predict the stage of fibrosis in CLD patients.Entities:
Keywords: Amino acids; Bile acids; Chronic liver disease; Free fatty acids; Hepatitis B; Liver fibrosis; Metabolomics; Random forest
Mesh:
Substances:
Year: 2020 PMID: 32498677 PMCID: PMC7273661 DOI: 10.1186/s12916-020-01595-w
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Demographic and clinical data of patients with CLD and NC in cohort 1 (training set) and cohort 2 (validation set)
| Dataset | Cohort 1 training set | Cohort 2 validation set | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Group | Control | CLD | S0–2 | S3–4 | Fibrosis | Cirrhosis | Control | CLD | S0–2 | S3–4 | Fibrosis | Cirrhosis |
| 502 | 504 | 349 | 155 | 400 | 104 | 90 | 300 | 134 | 166 | 141 | 159 | |
| Sex (M/F) | 365/137 | 361/143 | 257/92 | 104/51 | 299/101 | 62/42 | 59/31 | 202/98 | 81/53 | 121/45* | 86/55 | 116/43* |
| Age (year) | 36.65 ± 11.73 | 36.58 ± 11.88 | 33.23 ± 9.95 | 44.88 ± 12.22*** | 34.05 ± 10.31 | 48.65 ± 11.51*** | 47.13 ± 9.95 | 47.96 ± 13.28 | 41.55 ± 12.83 | 53.14 ± 11.26*** | 41.21 ± 12.72 | 53.95 ± 10.67*** |
| BMI (kg/m2) | 23.08 ± 3.16 | 22.28 ± 3.18*** | 22.02 ± 3.26 | 22.91 ± 2.91** | 22.15 ± 3.22 | 22.87 ± 2.97* | 22.35 ± 1.83 | 23.18 ± 3.13* | 23.28 ± 2.48 | 23.1 ± 3.61 | 23.21 ± 2.49 | 23.15 ± 3.65 |
| APRI | 0.09 ± 0.04 | 0.79 ± 1.33*** | 0.6 ± 0.81 | 1.27 ± 2.08*** | 0.63 ± 0.92 | 1.55 ± 2.4** | 0.65 ± 0.7 | 0.61 ± 0.78 | 0.69 ± 0.57** | 0.6 ± 0.76 | 0.72 ± 0.58*** | |
| AST/ALT | 0.82 ± 0.32 | 0.69 ± 0.44*** | 0.6 ± 0.31 | 0.94 ± 0.59*** | 0.61 ± 0.33 | 1.1 ± 0.63*** | 1.09 ± 0.6 | 0.98 ± 0.57 | 1.24 ± 0.61*** | 0.97 ± 0.57 | 1.27 ± 0.61*** | |
| FIB-4 | 0.62 ± 0.33 | 2.86 ± 7.21*** | 1.42 ± 1.48 | 6.57 ± 12.71*** | 1.56 ± 1.7 | 9.42 ± 15.8*** | 3.64 ± 3.64 | 2.49 ± 2.58 | 5.26 ± 4.27*** | 2.45 ± 2.53 | 5.54 ± 4.3*** | |
| ALT (IU/L) | 30.97 ± 15.63 | 176.49 ± 199.81*** | 195.3 ± 213.41 | 128.51 ± 150.32*** | 193.08 ± 208.27 | 94.18 ± 121.97*** | 17.92 ± 7.79 | 81.57 ± 117.77*** | 111.4 ± 141.39 | 57.2 ± 87.35*** | 108.45 ± 138.58 | 57.43 ± 89.06*** |
| AST (IU/L) | 21.81 ± 6.84 | 93.21 ± 99.71*** | 95.65 ± 100.23 | 86.98 ± 98.47 | 96.32 ± 102.01 | 77.75 ± 86.32 | 20.25 ± 4.37 | 68.73 ± 73.78*** | 85.6 ± 91.48 | 54.95 ± 51.62* | 83.2 ± 89.82 | 55.74 ± 52.57 |
| TBIL (μmol/L) | 15.5 ± 4.84 | 27.73 ± 32.98*** | 21.75 ± 13.27 | 42.87 ± 55.62*** | 23.06 ± 20.64 | 50.58 ± 61.18*** | 13.98 ± 3.62 | 33.67 ± 47.77*** | 23.77 ± 38.4 | 41.77 ± 52.99*** | 23.54 ± 37.45 | 42.77 ± 53.94*** |
| ALP (IU/L) | 85.57 ± 19.18 | 89.72 ± 76.26 | 81.3 ± 65.24 | 111.05 ± 95.86** | 82.37 ± 61.46 | 125.77 ± 119.98** | 77.82 ± 19.21 | 93.68 ± 70.86* | 70.57 ± 55.75 | 112.56 ± 76.25*** | 70.97 ± 54.47 | 114.08 ± 77.53*** |
| GGT (IU/L) | 17.12 ± 10.72 | 69.18 ± 97.06*** | 61.13 ± 69.75 | 89.73 ± 143.43* | 66.72 ± 74.55 | 81.42 ± 169.46 | 26.26 ± 19.07 | 76.87 ± 82.16*** | 77.22 ± 83.58 | 76.59 ± 81.23 | 76.14 ± 81.77 | 77.53 ± 82.76 |
| TP (g/L) | 74.41 ± 4.76 | 73.47 ± 8.55* | 75.68 ± 5.31 | 67.87 ± 12.03*** | 75.42 ± 5.24 | 63.82 ± 13.76*** | 73.65 ± 3.19 | 71.31 ± 17.94*** | 72.89 ± 5.84 | 69.97 ± 23.74*** | 73.07 ± 5.82 | 69.66 ± 24.23*** |
| ALB (g/L) | 49.23 ± 2.77 | 40.19 ± 5.91*** | 42.14 ± 3.37 | 35.23 ± 7.81*** | 41.75 ± 3.54 | 32.47 ± 8.67*** | 44.92 ± 2.19 | 37.86 ± 7.18*** | 41.44 ± 4.79 | 34.93 ± 7.48*** | 41.39 ± 4.71 | 34.69 ± 7.54*** |
| TBA (μmol/L) | 4.67 ± 3.18 | 28.47 ± 45.11*** | 20.33 ± 37.28 | 49.68 ± 55.81*** | 21.3 ± 36.44 | 65.41 ± 63.96*** | 3.6 ± 2.63 | 43.41 ± 55.16*** | 25.05 ± 37.86 | 58.89 ± 62.37*** | 24.33 ± 37.04 | 61.11 ± 62.91*** |
| PLT (109/L) | 261.02 ± 65.25 | 164.52 ± 61.7*** | 184.2 ± 47.74 | 114.68 ± 65.05*** | 179.06 ± 49.86 | 93.23 ± 64.82*** | 132.44 ± 62.97 | 163.58 ± 51 | 106.03 ± 60.14*** | 161.54 ± 51.18 | 105.27 ± 60.9*** | |
| Collagen proportionate area | 7.46 ± 4.01 | 1.96 ± 1.43 | 9.95 ± 6.03*** | 2.71 ± 2.45 | 15.17 ± 7.11*** | |||||||
| HBV-DNA (log10) | 6.25 ± 2.42 | 6.32 ± 2.44 | 5.99 ± 2.34 | 6.33 ± 2.38 | 5.39 ± 2.66 | |||||||
| Negative HbeAg, | 191 | 115 | 76 | 142 | 49 | |||||||
| Negative HbeAb, | 223 | 153 | 70 | 175 | 48 | |||||||
| Negative HbsAg, | 29 | 24 | 5 | 26 | 3 | |||||||
Values are expressed as mean ± SD
ALT alanine transaminase, AST aspartate transaminase, TBIL total bilirubin, ALP alkaline phosphatase, GGT gamma-glutamyl transferase, ALB albumin, TBA total bile acid, PLT platelet
*p < 0.05, **p < 0.01, ***p < 0.001, by Student’s t test, CLD vs. NC, S3–4 vs. S0–2, cirrhosis vs. fibrosis
Fig. 2Workflow chart of feature selection. For a total of 98 metabolites (including AAs, BAs, and FFAs), univariate analyses (Wilcoxon’s rank-sum test) were employed for three clinical aims (aim 1: CLD vs. NC, aim 2: cirrhosis vs. fibrosis, aim 3: early fibrosis vs. advanced fibrosis). Twenty-six metabolites with p < 0.001 in all three clinical aims were selected and fed into LASSO and random forest algorithms for three aims. The overlap of top 5 LASSO non-zero coefficients and top 5 important variables from random forest (by mean decrease of accuracy) was selected. For aim 2 and aim 3, we selected the overlapped variables and combined with variables selected from aim 1 to yield the final panel four metabolites. “OR” means the union of two sets, and “AND” means the intersection of two or more sets
Fig. 1Study design. Serum metabolites were quantified in cohort 1 (504 biopsy-proven HBV-CLD patients and 502 NC) and were used to identify candidate markers. After data analysis and feature selection, four metabolites were selected to compose our marker panel. Different machine models and clinical indices were compared using 10-fold cross-validation. Three RF models were constructed to diagnose CLD from NC (model 1), differentiate fibrosis vs. cirrhosis (model 2), and grade early fibrosis vs. advanced fibrosis (model 3) in cohort 1. These three were further validated in the independent HBV cohort 2
Fig. 3Metabolite marker panel and model 1 for CLD with chronic HBV infection diagnosis. a Comparison of the four markers between CLD patients and NC in cohorts 1 and 2. b Waterfall plot of RF score and corresponding heatmap for the four markers in all datasets. c ROC curves of model 1 (RF model constructed with four markers), APRI, AST/ALT, and FIB-4 in cohort 1. d PR curves of model 1, APRI, AST/ALT, and FIB-4 in cohort 1. e ROC curves of model 1, APRI, AST/ALT, and FIB-4 in cohort 2 validation set. f PR curves of model 1, APRI, AST/ALT, and FIB-4 in cohort 2 validation set. g The diagnosis RF score in NC and CLD patients in training and validation sets. ***p < 0.001, Wilcoxon’s rank-sum test. The optimal cutoff value of the RF score was 0.434
Results for measurement of the metabolite marker panel, APRI, FIB-4, and AST/ALT ratio in the prediction of liver fibrosis
| Cohort 1 training set | Cohort 2 validation set | |||||
|---|---|---|---|---|---|---|
| CLD vs. controls | Fibrosis vs. cirrhosis | S0–2 vs. S3-4 | CLD vs. controls | Fibrosis vs. cirrhosis | S0–2 vs. S3-4 | |
| Metabolite marker panel | ||||||
| AUROC (95% CI)# | 0.997 (0.993–1) | 0.941 (0.914–0.964) | 0.918 (0.889–0.946) | 0.977 (0.963–0.988) | 0.844 (0.797–0.884) | 0.807 (0.756–0.852) |
| AUPR (95% CI) | 0.994 (0.986–1) | 0.87 (0.824–0.913) | 0.892 (0.854–0.925) | 0.993 (0.989–0.997) | 0.827 (0.761–0.884) | 0.817 (0.764–0.866) |
| Cutoff value (sensitivity (%)/specificity (%)/F1 (%))† | 0.434 (98.4/99/98.7) | 0.01 (87/90.4/78.4) | − 0.115 (86.7/90.5/84.6) | 0.434 (92.2/94.4/95.2) | 0.01 (71.8/81.6/73.3) | − 0.115 (72.9/76.1/71.8) |
| FIB-4 | ||||||
| AUROC (95% CI) | 0.848 (0.823–0.87) | 0.869 (0.829–0.906) | 0.802 (0.762–0.844) | 0.707 (0.652–0.762) | 0.758 (0.692–0.815) | 0.739 (0.68–0.798) |
| AUPR (95% CI) | 0.863 (0.84–0.883) | 0.725 (0.657–0.79) | 0.707 (0.651–0.761) | 0.897 (0.873–0.918) | 0.726 (0.657–0.794) | 0.726 (0.66–0.795) |
| Cutoff value 1 (sensitivity (%)/specificity (%)/F1 (%))* | 1.45 (68/73.6/62) | 1.45 (81.5/42.5/68.1) | ||||
| Cutoff value 2 (sensitivity (%)/specificity (%)/F1 (%))* | 3.25 (44.2/93.4/56.3) | 3.25 (57/81.3/65.5) | ||||
| APRI | ||||||
| AUROC (95% CI) | 0.973 (0.965–0.981) | 0.698 (0.644–0.752) | 0.647 (0.595–0.698) | 0.879 (0.841–0.915) | 0.608 (0.534–0.671) | 0.595 (0.529–0.669) |
| AUPR (95% CI) | 0.977 (0.969–0.983) | 0.416 (0.345–0.497) | 0.492 (0.434–0.554) | 0.958 (0.942–0.972) | 0.53 (0.474–0.605) | 0.542 (0.488–0.614) |
| Cutoff value 1 (sensitivity (%)/specificity (%)/F1 (%))** | 1 (33.9/86.2/37.1) | 1 (22.7/84.4/32.4) | ||||
| Cutoff value 2 (sensitivity (%)/specificity (%)/F1 (%))** | 2 (18.3/94.6/26.6) | 2 (3.9/94.3/8.5) | ||||
| AST/ALT | ||||||
| AUROC (95% CI) | 0.665 (0.631–0.697) | 0.815 (0.766–0.862) | 0.714 (0.668–0.759) | 0.603 (0.54–0.657) | 0.684 (0.619–0.75) | 0.667 (0.597–0.728) |
| AUPR (95% CI) | 0.714 (0.685–0.747) | 0.579 (0.496–0.674) | 0.582 (0.516–0.654) | 0.849 (0.819–0.875) | 0.641 (0.573–0.721) | 0.648 (0.583–0.727) |
| Cutoff value 1 (sensitivity (%)/specificity (%)/F1 (%))*** | 0.8 (48.1/84.8/54.2) | 0.8 (78.5/42.5/66.7) | ||||
| Cutoff value 2 (sensitivity (%)/specificity (%)/F1 (%))*** | 1 (33.1/92.8/45.1) | 1 (65.9/59/63.8) | ||||
| Comparison of AUROC | ||||||
| Metabolite marker panel versus FIB-4**** | 0.01 | |||||
| Metabolite marker panel versus APRI**** | ||||||
| Metabolite marker panel versus AST/ALT**** | ||||||
#95% CI was calculated using 1000 times bootstrap resampling on ROC and PR curves. CI confidence interval
†Cutoff values were determined to maximize the sum of sensitivity and specificity for the cohort 1 training dataset
*Predetermined cutoff values of FIB-4 were used (1.45 and 3.25 to distinguish extensive fibrosis)
**Predetermined cutoff values of APRI were used (1.0 and 2.0 to distinguish cirrhosis)
***Predetermined cutoff values of AST/ALT were used (0.8 and 1.0 to distinguish extensive fibrosis)
APRI AST-to-platelet ratio index, AST/ALT aspartate transaminase/alanine transaminase ratio, FIB-4 fibrosis-4 index, AUROC area under the receiver operating characteristic curve, AUPR area under the precision-recall (PR) curve
****Comparisons of AUROC between biomarker panel vs. FIB-4, AST/ALT, or APRI were performed using DeLong’s test
Fig. 4Model 2 for differentiating fibrosis vs. cirrhosis and model 3 for differentiating early fibrosis vs. advanced fibrosis in CLD patients with chronic HBV infection. a ROC curves of model 2 (RF model constructed with four metabolite markers and age), APRI, AST/ALT, and FIB-4 in cohort 1. b PR curves of model 2, APRI, AST/ALT, and FIB-4 in cohort 1. c ROC curves for model 2, APRI, AST/ALT, and FIB-4 for the cohort 2 validation set. d PR curves for model 2, APRI, AST/ALT, and FIB-4 for the cohort 2 validation set. e The RF score in CLD patients with fibrosis and cirrhosis in the HBV training, validation sets. The optimal cutoff value of the RF score was 0.01. f ROC curves of model 3 (RF model constructed with four metabolite markers and age), APRI, AST/ALT, and FIB-4 in cohort 1. g PR curves of model 3, APRI, AST/ALT, and FIB-4 in cohort 1. h ROC curves for model 3, APRI, AST/ALT, and FIB-4 for the cohort 2 validation set. i PR curves for model 3, APRI, AST/ALT, and FIB-4 for the cohort 2 validation set. j The RF score in CLD patients with S0–2 and S3–4 in the HBV training, validation sets. The optimal cutoff value of the RF score was − 0.115. ***p < 0.001, Wilcoxon’s rank-sum test