| Literature DB >> 36012106 |
Svenja Sydor1, Christian Dandyk2, Johannes Schwerdt3, Paul Manka1, Dirk Benndorf2,4, Theresa Lehmann2, Kay Schallert2,4,5, Maximilian Wolf6, Udo Reichl2,4, Ali Canbay1, Lars P Bechmann1, Robert Heyer2,4,5,6.
Abstract
High-calorie diets lead to hepatic steatosis and to the development of non-alcoholic fatty liver disease (NAFLD), which can evolve over many years into the inflammatory form of non-alcoholic steatohepatitis (NASH), posing a risk for the development of hepatocellular carcinoma (HCC). Due to diet and liver alteration, the axis between liver and gut is disturbed, resulting in gut microbiome alterations. Consequently, detecting these gut microbiome alterations represents a promising strategy for early NASH and HCC detection. We analyzed medical parameters and the fecal metaproteome of 19 healthy controls, 32 NASH patients, and 29 HCC patients, targeting the discovery of diagnostic biomarkers. Here, NASH and HCC resulted in increased inflammation status and shifts within the composition of the gut microbiome. An increased abundance of kielin/chordin, E3 ubiquitin ligase, and nucleophosmin 1 represented valuable fecal biomarkers, indicating disease-related changes in the liver. Although a single biomarker failed to separate NASH and HCC, machine learning-based classification algorithms provided an 86% accuracy in distinguishing between controls, NASH, and HCC. Fecal metaproteomics enables early detection of NASH and HCC by providing single biomarkers and machine learning-based metaprotein panels.Entities:
Keywords: fecal microbiota; hepatocellular carcinoma; machine learning; metaproteomics; non-alcoholic steatohepatitis
Mesh:
Substances:
Year: 2022 PMID: 36012106 PMCID: PMC9408600 DOI: 10.3390/ijms23168841
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Figure 1Multilayer PCA of clinical parameters, human and metaproteins, and family-level taxonomy for all samples. The individual PCAs were visualized on the left, and same samples were connected across all layers. The associated biplots with the top 10 loadings were shown on the right side. For human and microbial metaproteins, the metaproteins with at least 0.01% of the total spectral count were selected. In contrast, for families and clinical parameters, all 419 and 30 were chosen, respectively. For better readability, the top ten human and microbial loadings (metaproteins) were summarized in a table below the plot.
Figure 2Abundance of the family Hominidae and the ratio of Bacteriodetes to Firmicutes. The abundance is based on the normalized abundance of identified spectra.
Figure 3Summary of selected changed features. Significance was evaluated by the Kruskal–Wallis test using a p-value cutoff smaller than 0.01 for families and metabolic functions and smaller than 10−5 for metaproteins.
Potential biomarker metaproteins between controls and diseased patients. We analyzed the ROC plot analysis for the most abundant ten metaproteins and summarized the area under the curve to evaluate metaprotein biomarkers.
| Metaproteins | #SpecAb | Area under Curve |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 4Machine learning-based sample classification between NASH, HCC, and controls. (A) shows the workflow of the software for feature selection, feature wrapping, and development of the classification algorithms. (B) shows display of the confusion matrix of the best-observed classifier (linear discriminant analysis.). Cross-validation ensured that the patient numbers in the confusion matrix were real and not natural numbers. The evaluation was performed by averaging a 5-fold cross-validation on 10,000 repeats. (C) shows the clustering of the data and their intrinsic similarity using Ward linkage and Canberra distances.
Classification accuracy for NASH, HCC, and controls. Results were obtained by the given number of features and the algorithms by averaging a 5-fold cross-validation on 10,000 repeats.
| Comparison | Accuracy | Number of Features |
|---|---|---|
| NASH vs. Control | 0.9998 | 7 features |
| HCC vs. Control: | 1 | 5 features |
| HCC vs. NASH | 0.8640 | 10 features |
| HCC vs. NASH vs. Control | 0.86 | 11 features |