| Literature DB >> 35050163 |
Bei Gao1,2, Tsung-Chin Wu3,4, Sonja Lang2, Lu Jiang2,5, Yi Duan2, Derrick E Fouts6, Xinlian Zhang4, Xin-Ming Tu4, Bernd Schnabl2,5.
Abstract
Alcoholic hepatitis is a major health care burden in the United States due to significant morbidity and mortality. Early identification of patients with alcoholic hepatitis at greatest risk of death is extremely important for proper treatments and interventions to be instituted. In this study, we used gradient boosting, random forest, support vector machine and logistic regression analysis of laboratory parameters, fecal bacterial microbiota, fecal mycobiota, fecal virome, serum metabolome and serum lipidome to predict mortality in patients with alcoholic hepatitis. Gradient boosting achieved the highest AUC of 0.87 for both 30-day mortality prediction using the bacteria and metabolic pathways dataset and 90-day mortality prediction using the fungi dataset, which showed better performance than the currently used model for end-stage liver disease (MELD) score.Entities:
Keywords: machine learning; metabolomics; microbiota; mycobiome; virome
Year: 2022 PMID: 35050163 PMCID: PMC8781791 DOI: 10.3390/metabo12010041
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Characteristics of patients with alcoholic hepatitis.
| Clinical Parameters | Alcoholic Hepatitis ( | |
|---|---|---|
| Sex (% male), | 138 (66.3%) | |
| Age (years), | 49.3 (26.4–74.8) | |
| BMI (kg/m2), | 28.2 (16.2–48.3) | |
| Laboratory parameter | ||
| Creatinine (mg/dL), | 1.1 (0.3–8.1) | |
| Bilirubin (mg/dL), | 16.0 (2.5–51.8) | |
| AST (IU/L), | 165.2 (34.0–1858.0) | |
| ALT (IU/L), | 57.5 (14.0–404.0) | |
| Albumin (g/dL), | 2.6 (1.1–4.2) | |
| INR, | 1.9 (0.8–7.6) | |
| GGT (IU/L), | 602.6 (33.0–3650.0) | |
| Platelet count (109/L), | 136.5 (12.2–447.0) | |
| Alkaline phosphatase (U/L), | 207.6 (21.2–1153.0) | |
| Prothrombin time, s, | 29.3 (9.0–141.0) | |
| Sodium (mEq/L), | 133.0 (106.0–148.0) | |
| FIB-4, | 11.5 (0.7–116.0) | |
| FIB-4 > 3.25 (F3-F4), | 171 (90.5%) | |
| Treatment at admission | ||
| Steroids, | 83 (43.0%) | |
| Antibiotics, | 53 (27.5%) | |
| Proton pump inhibitors, | 13 (13.3%) | |
| Infections, | 41 (25.9%) | |
| Clinical scores and outcome | ||
| MELD, median (range), | 26.8 (10.1–48.6) | |
| MELD > 21, | 158 (82.3%) | |
| 30-day mortality ( | 31 (14.8%) | |
| 90-day mortality ( | 54 (34.2%) | |
| Histology | ||
| Liver biopsy available, | 120 (60.9%) | |
| Stage of fibrosis, | 0 | 3 (2.5%) |
| 1 | 3 (2.5%) | |
| 2 | 15 (12.7%) | |
| 3 | 18 (15.3%) | |
| 4 | 79 (66.9%) |
Note: Values presented are median with range in parentheses for continuous variables or number and percentage in parentheses for categorical variables. Percentages were calculated based on the actual number of patients in each group, when data were available. The number of subjects for which data were available is indicated in the first column. MELD, model for end-stage liver disease; BMI, body mass index; AST, aspartate aminotransferase; ALT, alanine aminotransferase; INR, international normalized ratio; GGT, gamma-glutamyl transferase; FIB-4, fibrosis-4 index. Fibrosis stage, 0 no fibrosis, 1 portal fibrosis, 2 expansive periportal fibrosis, 3 bridging fibrosis, 4 cirrhosis.
Figure 1Mortality prediction with clinical parameters in patients with alcoholic hepatitis. (A) A total of 210 patients were included in this study. 31 died within 30-day and 179 patients were alive, 23 patients died within 90 days. A total of 104 patients survived at 90 days. The remaining 52 patients were lost to follow-up. Blue: alive. Red: deceased. Grey: unknown status. (B) Survival probability within 90 days. Confidence intervals are shown in light blue. +: Patients lost to follow-up were censored at the time they were last seen alive. (C) Prediction of 30-day (red line, AUC = 0.78) and 90-day mortality using MELD score (purple line, AUC = 0.82). (D) Four models for 30-day mortality prediction in patients with alcoholic hepatitis using clinical data. Day-30 deceased group n = 31, alive group n = 179. (E) Four models for 90-day mortality prediction in patients with alcoholic hepatitis using clinical data. Day-90 deceased group n = 54, alive group n = 104. GB: gradient boosting, LR: logistic regression, RF: random forest, SVM: support vector machine.
Variables with the top 11 average feature importance in each dataset.
|
|
|
| |
| 30-day | age | international normalized ratio | creatinine |
| creatinine | creatinine | international normalized ratio | |
| bilirubin | sodium | bilirubin | |
| albumin | PWY-6125: superpathway of guanosine nucleotides de novo biosynthesis II | sodium | |
| international normalized ratio | DTDPRHAMSYN-PWY: dTDP-L-rhamnose biosynthesis I | Aspergillus | |
| alanine transaminase | PWY-7229: superpathway of adenosine nucleotides de novo biosynthesis I | alkaline phosphatase | |
| alkaline phosphatase | PWY-7222: guanosine deoxyribonucleotides de novo biosynthesis II | age | |
| platelet count | PWY-7228: superpathway of guanosine nucleotides de novo biosynthesis I | aspartate transaminase | |
| white blood cell count | PANTO-PWY: phosphopantothenate biosynthesis I | platelets | |
| aspartate transaminase | PWY-6126: superpathway of adenosine nucleotides de novo biosynthesis II | alanine transaminase | |
| sodium | PWY-7220: adenosine deoxyribonucleotides de novo biosynthesis II | white blood cell count | |
| 90-day | age | international normalized ratio | creatinine |
| creatinine | creatinine | bilirubin | |
| bilirubin | bilirubin | sodium | |
| albumin | sodium | international normalized ratio | |
| international normalized ratio | PWY-7229: superpathway of adenosine nucleotides de novo biosynthesis I | age | |
| alanine transaminase | DTDPRHAMSYN-PWY: dTDP-L-rhamnose biosynthesis I | albumin | |
| alkaline phosphatase | GLUCOSE1PMETAB-PWY: glucose and glucose-1 phosphate degradation | alkaline phosphatase | |
| platelet count | PWY-5989: stearate biosynthesis II bacteria and plants | aspartate transaminase | |
| white blood cell count | Clostridium nexile | platelets | |
| aspartate transaminase | PWY0-1297: superpathway of purine deoxyribonucleosides degradation | white blood cell count | |
| sodium | PWY-6125: superpathway of guanosine nucleotides de novo biosynthesis II | alanine transaminase | |
|
|
|
| |
| 30-day | creatinine | creatinine | creatinine |
| international normalized ratio | international normalized ratio | L-Hydroxyarginine | |
| bilirubin | bilirubin | bilirubin | |
| sodium | Zonulin | Urea | |
| Epsilon15 virus | albumin | Pseudo uridine | |
| aspartate transaminase | sodium | Maltose | |
| P22 virus | Lipopolysaccharide binding protein | Erythritol | |
| Lambda virus | Anti-Saccharomyces cerevisiae antibodies | Metabolite creatinine | |
| alkaline phosphatase | age | S-adenosyl homocysteine | |
| alanine transaminase | platelets | L-Carnitine | |
| age | alanine transaminase | Adenine | |
| 90-day | creatinine | creatinine | creatinine |
| international normalized ratio | international normalized ratio | international normalized ratio | |
| bilirubin | bilirubin | L-Homocitrulline | |
| sodium | age | Lyxitol | |
| age | Anti-Saccharomyces cerevisiae antibodies | Pseudo uridine | |
| albumin | sodium | L-Hydroxyarginine | |
| alanine transaminase | white blood cell count | Adipoyl-L-carnitine | |
| alkaline phosphatase | Zonulin | Asymmetric dimethylarginine | |
| Epsilon15 virus | albumin | sodium | |
| platelets | Lipopolysaccharide binding protein | Acyl carnitines | |
| white blood cell count | alkaline phosphatase | Kynurenic acid |
Summary of AUC scores for each dataset.
| Model | Clinical Data | Clinical Data + Bacteria + Metacyc Pathways | Clinical Data + Fungi | Clinical Data + Virus | Clinical Data + Metabolites + Lipids | Clinical Data + ELISA | |
|---|---|---|---|---|---|---|---|
| 30-day Mortality | LR(MELD) | 0.78 | 0.79 | 0.72 | 0.72 | 0.77 | 0.77 |
| LR | 0.74 | 0.59 | 0.56 |
| 0.65 | 0.72 | |
| SVM | 0.76 | 0.68 |
|
| 0.74 | 0.74 | |
| RF |
|
| 0.45 |
| 0.73 | 0.76 | |
| GB |
|
| 0.35 | 0.69 | 0.71 | 0.69 | |
| 90-day Mortality | LR(MELD) | 0.82 | 0.63 | 0.25 | 0.67 | 0.83 | 0.79 |
| LR | 0.80 |
|
| 0.62 | 0.67 | 0.64 | |
| SVM | 0.80 |
|
| 0.63 | 0.69 | 0.64 | |
| RF | 0.80 | 0.54 |
| 0.53 | 0.78 | 0.71 | |
| GB | 0.79 | 0.47 |
| 0.58 | 0.71 | 0.71 |
Note: LR: logistic regression, SVM: support vector machine, RF: random forest, GB: gradient boosting, LR (MELD): logistic regression model using MELD score only based on the same subset of patients as each dataset. Clinical data day-30 deceased group n = 31, alive group n = 179. Day-90 deceased group n = 54, alive group n = 104. Clinical data, bacteria and MetaCyc pathways day 30: deceased group n = 8; alive group n = 65. Day 90: deceased group n = 13; alive group n = 40. Clinical data and fungi day 30: deceased group n = 5; alive group n = 49. Day 90: deceased group n = 9; alive group n = 30. Clinical data and virus day 30: deceased group n = 8; alive group n = 68. Day 90: deceased group n = 14; alive group n = 42. Clinical data, metabolites and lipids day 30: deceased group n = 19; alive group n = 99. Day 90: deceased group n = 33; alive group n = 57. Clinical data and serum biomarkers day 30: decreased group n = 20; alive group n = 118. Day 90: deceased group n = 36; alive group n = 68. Bold: ROC score higher than LR (MELD) model in each dataset. Italic: highest AUC for 30-day or 90-day mortality.
Figure 2Multi-omics datasets used for mortality prediction in patients with alcoholic hepatitis. A total of (1) 529 fecal bacterial species and 472 MetaCyc pathways, (2) 81 fecal fungal genera, (3) 83 fecal viral genera, (4) 447 serum metabolites and 287 serum lipids were included.
Figure 330- and 90-day mortality prediction using fecal bacteria, MetaCyc pathways and clinical data in patients with alcoholic hepatitis. (A) 30- and 90-day mortality prediction using MELD score. (B) 30-day mortality prediction using fecal bacteria, Metacyc pathways and clinical data. Deceased group n = 8; alive group n = 65. (C) 90-day mortality prediction using fecal bacteria, Metacyc pathways and clinical data. Deceased group n = 13; alive group n = 40. GB: gradient boosting, LR: logistic regression, RF: random forest, SVM: support vector machine.
Figure 430- and 90-day mortality prediction using fecal fungi and clinical data in patients with alcoholic hepatitis. (A) 30- and 90-day mortality prediction using MELD score. (B) 30-day mortality prediction using fecal fungi and clinical data. Deceased group n = 5; alive group n = 49. (C) 90-day mortality prediction using fecal fungi and clinical data. Deceased group n = 9; alive group n = 30. GB: gradient boosting, LR: logistic regression, RF: random forest, SVM: support vector machine.
Figure 530- and 90-day mortality prediction using fecal viruses and clinical data in patients with alcoholic hepatitis. (A) 30- and 90-day mortality prediction using MELD score. (B) 30-day mortality prediction using fecal viruses and clinical data. Deceased group n = 8; alive group n = 68. (C) 90-day mortality prediction using fecal viruses and clinical data. Deceased group n = 14; alive group n = 42. GB: gradient boosting, LR: logistic regression, RF: random forest, SVM: support vector machine.
Figure 630- and 90-day mortality prediction using serum metabolites, serum lipids and clinical data in patients with alcoholic hepatitis. (A) 30- and 90-day mortality prediction using MELD score. (B) 30-day mortality prediction using serum metabolites, serum lipids and clinical data. Deceased group n = 19; alive group n = 99. (C) 90-day mortality prediction using serum metabolites, serum lipids and clinical data. Deceased group n = 33; alive group n = 57. GB: gradient boosting, LR: logistic regression, RF: random forest, SVM: support vector machine.
Figure 730- and 90-day mortality prediction using serum biomarkers and clinical data in patients with alcoholic hepatitis. (A) 30- and 90-day mortality prediction using MELD score. (B) 30-day mortality prediction using serum biomarkers and clinical data. Decreased group n = 20; alive group n = 118. (C) 90-day mortality prediction using serum biomarkers and clinical data. Decreased group n = 36; alive group n = 68. GB: gradient boosting, LR: logistic regression, RF: random forest, SVM: support vector machine.