| Literature DB >> 32626836 |
Galen Miller-Atkins1, Lou-Anne Acevedo-Moreno2, David Grove3, Raed A Dweik4, Adriano R Tonelli4, J Mark Brown5,6, Daniela S Allende7, Federico Aucejo2, Daniel M Rotroff1.
Abstract
Hepatocellular carcinoma (HCC) and secondary liver tumors, such as colorectal cancer liver metastases are significant contributors to the overall burden of cancer-related morality. Current biomarkers, such as alpha-fetoprotein (AFP) for HCC, result in too many false negatives, necessitating noninvasive approaches with improved sensitivity. Volatile organic compounds (VOCs) detected in the breath of patients can provide valuable insight into disease processes and can differentiate patients by disease status. Here, we investigate whether 22 VOCs from the breath of 296 patients can distinguish those with no liver disease (n = 54), cirrhosis (n = 30), HCC (n = 112), pulmonary hypertension (n = 49), or colorectal cancer liver metastases (n = 51). This work extends previous studies by evaluating the ability for VOC signatures to differentiate multiple diseases in a large cohort of patients. Pairwise disease comparisons demonstrated that most of the VOCs tested are present in significantly different relative abundances (false discovery rate P < 0.1), indicating broad impacts on the breath metabolome across diseases. A predictive model developed using random forest machine learning and cross validation classified patients with 85% classification accuracy and 75% balanced accuracy. Importantly, the model detected HCC with 73% sensitivity compared with 53% for AFP in the same cohort. An added value of this approach is that influential VOCs in the predictive model may provide insight into disease etiology. Acetaldehyde and acetone, both of which have roles in tumor promotion, were considered important VOCs for differentiating disease groups in the predictive model and were increased in patients with cirrhosis and HCC compared to patients with no liver disease (false discovery rate P < 0.1).Entities:
Year: 2020 PMID: 32626836 PMCID: PMC7327218 DOI: 10.1002/hep4.1499
Source DB: PubMed Journal: Hepatol Commun ISSN: 2471-254X
FIG. 1Workflow diagram for data analysis and predictive modeling. Abbreviation: MS, mass spectrometry.
Study Cohort Summary
| Characteristic | Cirrhosis | HCC | CRLM | Healthy Control | Pulmonary Hypertension |
|---|---|---|---|---|---|
| Total (n) | 30 | 112 | 51 | 54 | 49 |
| Mean age (min‐max) | 59.6 (37‐79) | 66.7 (25‐95) | 55.9 (26‐82) | 58.8 (36‐80) | 61.9 (45‐85) |
| Sex | |||||
| Male (%) | 14 (46%) | 84 (75%) | 27 (50%) | 27 (50%) | 14 (29%) |
| Female (%) | 16 (53%) | 28 (25%) | 14 (26%) | 14 (26%) | 35 (71%) |
| Race | |||||
| Caucasian (%) | 25 (83%) | 80 (71%) | |||
| Black (%) | 3 (10%) | 21 (19%) | |||
| Hispanic (%) | 1 (3%) | 1 (1%) | |||
| Other (%) | 1 (3%) | 4 (4%) | 2 (4%) | ||
| Mean BMI (min‐max) | 27.66 (12.54‐40.17) | 28.61 (15.6‐50.76) | 26.3 (17.71‐47.46) | ||
| Cirrhosis | |||||
| Yes (%) | 30 (100%) | 75 (67%) | 1 (2%) | ||
| NASH (%) | 2 (6%) | 14 (13%) | 0 | ||
| EtOH (%) | 8 (26%) | 26 (23%) | 0 | ||
| HCV (%) | 8 (26%) | 52 (46%) | 0 | ||
| HBV (%) | 2 (6%) | 2 (2%) | 0 | ||
| Hemochromatosis (%) | 4 (13%) | 0 | |||
| Alpha 1 antitypsin deficiency (%) | 1 (1%) | 0 | |||
| Wilson’s disease (%) | 0 | ||||
| Other (%) | 9 (90%) | 15 (13%) | 1 (2%) | ||
| No (%) | 31 28%) | 48 (94%) | |||
| Child‐Pugh score | |||||
| 1 | 1 (3%) | ||||
| 5 | 1 (3%) | 8 (7%) | 14 (27%) | ||
| 6 | 3 (3%) | 1 (1%) | |||
| 7 | 1 (1%) | 1 (1%) | |||
| 8 | 1 (1%) | ||||
| Diabetes mellitus (%) | 10 (33%) | 37 (33%) | 1 (1%) | ||
| Hypertension (%) | 14 (46%) | 62 (55%) | 12 (24%) | ||
| Coronary artery disease (%) | 5 (16%) | 17 (15%) | 2 (2%) | ||
| Hyperlipidemia (%) | 8 (26%) | 28 (25%) | 6 (12%) | ||
| Psychiatric disorder (%) | 10 (33%) | 14 (13%) | 5 (10%) | ||
| Other cancer history | |||||
| B‐cell lymphoma (%) | 2 (6%) | ||||
| Granulosa cell tumor (%) | 1 (3%) | ||||
| COPD/Asthma/OSA | 8 (26%) | 22 (20%) | 5 (10%) | ||
| Thyroid | 4 (13%) | 8 (7%) | 2 (2%) | ||
| Other PH | 3 (10%) | 43 (38%) | 16 (31%) | ||
| Encephalopathy | |||||
| Grade 1‐2 (%) | 27 (90%) | 1 (1%) | 1 (2%) | ||
| Grade 3‐4 (%) | 1 (3%) | ||||
| None (%) | 1 (3%) | 103 (92%) | 43 84%) | ||
| Mean hemoglobin (SEM) | 11.6 (0.403) | 12.7 (0.222) | 12.1 (0.339) | ||
| Mean platelets (SEM) | 142 (15.8) | 180 (10.5) | 188 (11.9) | ||
| Mean ALP (SEM) | 163 (20.6) | 141 (9.7) | 142 (19.7) | ||
| Mean AST (SEM) | 43.8 (3.77) | 78.6 (10.7) | 71.9 (24.2) | ||
| Mean ALT (SEM) | 32 (4.46) | 76.3 (11.7) | 58.4 (17.4) | ||
| Mean bilirubin (SEM) | 1.12 (0.17) | 0.998 (0.09) | 0.929 (0.259) | ||
| Mean albumin (SEM) | 3.52 (0.11) | 3.58 (0.06) | 3.97 (0.07) | ||
| Mean INR (SEM) | 1.2 (0.09) | 1.19 (0.04) | 1.10 (0.04) | ||
| Mean glucose (SEM) | 122 (9.81) | 128 (6.61) | 107 (4.78) | ||
| Mean creatinine (SEM) | 1.51 (0.26) | 1.02 (0.06) | 0.863 (0.06) | ||
Sample size (n) and summary statistics are presented, grouped by diagnosis.
Abbreviations: ALP, alkaline phosphatase; ALT, alanine aminotransferase; AST, aspartate aminotransferase; BMI, body mass index; COPD, chronic obstructive pulmonary disease; EtOH, alcoholic hepatitis; HBV, hepatitis B virus; HCV, hepatitis C virus; INR, international normalized ratio; OSA, obstructive sleep apnea; PH, pulmonary hypertension.
Top 3 Association Results Between Metabolites and Each Pairwise Combination of Disease Statuses
| Comparison | N | Metabolite | Mean Difference in Relative Abundance | Log (Odds Ratio [SEM]) | OR (95% CI) |
| FDR |
|---|---|---|---|---|---|---|---|
| Healthy vs. HCC | 136 | (E)‐2‐Nonene | 0.35 | 7.4 (1.4) | 1,600 (100‐25,000) | 1.53 × 10−7 | 3.36 × 10−6 |
| Ethane | 15.52 | 9.4 (2) | 12,000 (250‐560,000) | 1.84 × 10−6 | 1.69 × 10−5 | ||
| Benzene | 1.13 | 2.3 (0.49) | 10 (3.9‐27) | 2.3 × 10−6 | 1.69 × 10−5 | ||
| Healthy versus cirrhosis | 77 | Methylhexane | 10.07 | 6.3 (1.3) | 540 (40‐7,300) | 2.12 × 10−6 | 2.65 × 10−5 |
| Decene | 0.49 | 6.8 (1.5) | 900 (52‐15,000) | 2.73 × 10−6 | 2.65 × 10−5 | ||
| Acrylonitrile | 0.39 | 5.7 (1.2) | 310 (27‐3,700) | 3.99 × 10−6 | 2.65 × 10−5 | ||
| Healthy versus PH | 85 | (E)‐2‐Nonene | 0.26 | 5.7 (1.2) | 310 (27‐3,600) | 3.77 × 10−6 | 8.29 × 10−5 |
| Acetaldehyde | 9.92 | 6.9 (1.6) | 1000 (47‐23,000) | 1.11 × 10−5 | 8.87 × 10−5 | ||
| Ethane | 10.85 | 8.2 (1.9) | 3500 (91‐140,000) | 1.12 × 10−5 | 8.87 × 10−5 | ||
| Healthy versus CRLM | 97 | (E)‐2‐Nonene | 0.35 | 2.8 (0.67) | 16 (4.3‐59) | 3.29 × 10−5 | 0.0007238 |
| Acetaldehyde | 2.31 | 3.6 (1) | 8 (5.3‐270) | 0.0003 | 0.003355 | ||
| Triethyl amine | 0.07 | 3 (0.89) | 21 (3.6‐120) | 0.00066 | 0.00484 | ||
| Cirrhosis versus HCC | 128 | Acetone |
|
| 0.267 (0.14‐0.5) | 3.53 × 10−5 | 0.0006 |
| Acetaldehyde |
|
| 0.13 (0.048‐0.35) | 6.5 × 10−5 | 0.0006 | ||
| Dimethyl Sulfide |
|
| 0.25 (0.12‐0.5) | 8.19 × 10−5 | 0.0006 | ||
| HCC versus CRLM | 136 | Isoprene |
|
| 0.21 (0.085‐0.54) | 0.00117 | 0.013 |
| Pentane |
|
| 0.12 (0.031‐0.43) | 0.00131 | 0.013 | ||
| Acetone |
|
| 0.39 (0.22‐0.72) | 0.0024 | 0.013 | ||
| Cirrhosis versus CRLM | 71 | Methylhexane |
|
| 0.0076 (0.0007‐0.088) | 9.31 × 10−5 | 0.0008 |
| Isoprene |
|
| 0.041 (0.008‐0.21) | 0.00015 | 0.0008 | ||
| Trimethyl amine |
|
| 0.062 (0.014‐0.26) | 0.000167 | 0.0008 |
Sample sizes reflect data after removing outliers and missing data.
Age was included as a covariate.
Sex was included as a covariate.
Age and sex were included as covariates.
Age, hypertension, and albumin were included as covariates.
Albumin and diabetes mellitus were included as covariates.
Abbreviations: CI, confidence interval; OR, odds ratio; PH, portal hypertension.
FIG. 2Bar plots for random forest results grouped by disease status and model type: age and sex only, metabolites only, or metabolites, age, and sex. Disease status is color‐coded (pink, pulmonary hypertension; white, healthy; gray, HCC; green, CRLM; and red, cirrhosis).
Random Forest Results Grouped by Model (Metabolite, Age, and Sex; Metabolite Only; or Age and Sex Only) and Disease Status
| Model | Disease Status | Classification Accuracy | Sensitivity | Specificity | Balanced Accuracy |
|---|---|---|---|---|---|
| Metabolites, age, and sex | Cirrhosis | 0.90 | 0.40 | 0.96 | 0.68 |
| CRLM | 0.86 | 0.51 | 0.94 | 0.72 | |
| HCC | 0.72 | 0.73 | 0.71 | 0.72 | |
| Healthy | 0.93 | 0.76 | 0.97 | 0.86 | |
| Pulmonary hypertension | 0.86 | 0.58 | 0.93 | 0.75 | |
| Average | 0.85 | 0.60 | 0.90 | 0.75 | |
| Metabolites only | Cirrhosis | 0.91 | 0.40 | 0.96 | 0.68 |
| CRLM | 0.78 | 0.28 | 0.89 | 0.58 | |
| HCC | 0.63 | 0.62 | 0.64 | 0.63 | |
| Healthy | 0.88 | 0.61 | 0.94 | 0.77 | |
| Pulmonary hypertension | 0.83 | 0.51 | 0.90 | 0.71 | |
| Average | 0.81 | 0.48 | 0.87 | 0.68 | |
| Age and sex only | Cirrhosis | 0.90 | 0.00 | 1.00 | 0.50 |
| CRLM | 0.81 | 0.37 | 0.90 | 0.64 | |
| HCC | 0.70 | 0.78 | 0.65 | 0.72 | |
| Healthy | 0.86 | 0.28 | 1.00 | 0.64 | |
| Pulmonary hypertension | 0.73 | 0.56 | 0.77 | 0.66 | |
| Average | 0.80 | 0.40 | 0.86 | 0.63 |
FIG. 3Bar plots for mean decrease in Gini scores for each variable in the random forest model from metabolites, age, and sex. Higher scores denote more importance to the model results (e.g., ethane is considered the most important VOC).