| Literature DB >> 31992752 |
Daniel Veyel1, Kathrin Wenger1, Andre Broermann2, Tom Bretschneider1, Andreas H Luippold1, Bartlomiej Krawczyk1, Wolfgang Rist3, Eric Simon4.
Abstract
Nonalcoholic steatohepatitis (NASH) is a major cause of liver fibrosis with increasing prevalence worldwide. Currently there are no approved drugs available. The development of new therapies is difficult as diagnosis and staging requires biopsies. Consequently, predictive plasma biomarkers would be useful for drug development. Here we present a multi-omics approach to characterize the molecular pathophysiology and to identify new plasma biomarkers in a choline-deficient L-amino acid-defined diet rat NASH model. We analyzed liver samples by RNA-Seq and proteomics, revealing disease relevant signatures and a high correlation between mRNA and protein changes. Comparison to human data showed an overlap of inflammatory, metabolic, and developmental pathways. Using proteomics analysis of plasma we identified mainly secreted proteins that correlate with liver RNA and protein levels. We developed a multi-dimensional attribute ranking approach integrating multi-omics data with liver histology and prior knowledge uncovering known human markers, but also novel candidates. Using regression analysis, we show that the top-ranked markers were highly predictive for fibrosis in our model and hence can serve as preclinical plasma biomarkers. Our approach presented here illustrates the power of multi-omics analyses combined with plasma proteomics and is readily applicable to human biomarker discovery.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31992752 PMCID: PMC6987209 DOI: 10.1038/s41598-020-58030-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Transcriptomic characterization of the rat CDAA model. (a) Overview of experimental layout for multi-omics model characterization. (b) Principal component analysis scores plot of RNA-Seq data from liver of weeks 4, 8, and 12 of CSAA and CDAA diet. (c) Number of deregulated genes (FC > |1|, Benjamini-Hochberg adj. p value < 0.01) at different time points as bar diagram and Venn diagram. (d) Hierarchical clustering of z-scored gene expression ratio time profiles. Overrepresentation analysis of Gene Ontology (GO) terms Biological Process (BP), Cellular Component (CC), and Molecular Function (MF) in clusters was done using Fisher’s exact test (Benjamini-Hochberg adj. p value). Shown here are the two most significant categories (category size <2000 genes, enrichment factor >1, intersection size >7 genes). Supplementary Table 1 contains the full result table. (e) Hepatotoxicity functional overrepresentation analysis from IPA for comparison of different time points (Benjamini-Hochberg adj. p value < 0.01, z-score > |0.75|).
Figure 2Proteomics analysis of the CDAA model at 12 weeks and comparison to transcriptomic data. (a) Volcano plots of protein changes observed in liver (left) and plasma (right). Significant changes are colored (T-test, permutation based FDR < 1%). (b) Venn diagram showing the overlap of liver and plasma proteomics data. Lower plot: corresponding log2 fold changes of liver and plasma proteins. Significantly changing proteins in both matrices are colored. (c) Correlation of individual transcript to protein log2 fold changes at week 12. Significant changes on both levels are colored. (d) Top two overrepresented sets of anti-regulated features on transcript and protein level (n = 49, Benjamini-Hochberg adj. p value < 0.05). Statistical overrepresentation of Gene Ontology (GO) terms (GO Database released 2019-01-01) for Molecular Function (GO MF), Cellular Component (GO CC), and Biological Process (GO BP) was tested against all overlapping features as reference list with the Panther online tool[68] (http://pantherdb.org/).
Figure 3Comparison of rat CDAA to human NASH mRNA datasets on pathway level. Analysis match of overrepresentation analyses of canonical pathways using IPA of all rat data and three selected human datasets. Data are filtered to give at least one hit in a human study and one rat CDAA time point with Benjamini-Hochberg adj. p value < 0.05. Left panel: Signaling pathways, right panel: metabolic pathways (see Supplementary Table S2 for details). Note: Due to different reference datasets, the p values of the analyses are not directly comparable.
Multi-dimensional ranking dimensions and their subscores.
| Dimension | Subscore (equally weighted) |
|---|---|
| Plasma specificity | Plasma Protein Differential Expression (dge score) |
| Plasma Protein - Histo Correlation (r2) | |
| Liver Specificity | Liver Protein Differential Expression (dge score) |
| Liver RNA Differential Expression (dge score) | |
| Liver Protein - Histo Correlation (r2) | |
| Liver RNA - Histo Correlation (r2) | |
| Prior Evidence | Association to Fibrosis (OpenTargets overall score) |
| Literature NASH biomarker (GeneRifs observed vs expected score) | |
| Literature NASH biomarker (Pubmed observed vs expected score) | |
| Patent NASH biomarker Somalogic (present = 1, 0 otherwise) | |
| NASH biomarker Integrity/MetaCore (present = 1, 0 otherwise) | |
| Protein Class | Secreted (secreted = 1, 0 otherwise) |
All proteins were scored according to four different dimensions using the sum of equally weighted subscores.
Figure 4Multi-dimensional attribute ranking for biomarker discovery. (a) Biomarker scoring scheme using the weighted sum of multiple normalized subscores (see Methods for description, Supplementary Table S3). (b) Ranking by total biomarker score using the default weight setting with the contribution of each subscore. The top 10 biomarkers are labeled by coding gene name. (c) Linear regression analysis of the top 10 ranked biomarkers (see b) using protein plasma intensity as predictor (x) for the first three components of the corresponding sample in the liver RNA and protein PCAs. PC1, which separates CDAA vs. CSAA samples, showed the best correlation to protein plasma intensity. Left heatmap: r2 values of regression analysis, right heatmap: p values of correlation. p values > 0.01 were shaded in grey. Clustering by r2 (d) Clustering of total scores obtained from the sensitivity analysis using six different weight settings. The corresponding weight settings are displayed at the top (grey-scale heatmap). The best rank for each protein in any of the weight settings is shown on the left (log2 of rank).
Top 10 biomarker candidates with annotated protein function, sign of regulation in liver on RNA level (LR), in liver on protein level (LP), and in plasma on protein level (PP) after 12 weeks of CDAA vs. CSAA diet.
| Gene Name | Protein Function (UniProt/Swissprot) | Regulation LR, LP, PP | Top2 enriched tissues (median fold change) |
|---|---|---|---|
| ADAMTSL2 | #N/A | Up, Up, Up | Adrenal Gland (5.6), Kidney (4.4) |
| C7 | Constituent of the membrane attack complex (MAC) that plays a key role in the innate and adaptive immune response by forming pores in the plasma membrane of target cells. | Up, Up, Up | Adrenal Gland (7.6), Ovary (3.2) |
| LGALS3BP | Promotes integrin-mediated cell adhesion. | Up, Up, Up | HSC - TGFb 2.5 ng (4.7), Stomach (2.9) |
| COL6A1 | Collagen VI acts as a cell-binding protein. | Up, Up, Up | HSC - TGFb 2.5 ng (11.1), HSC - Control (8.8) |
| APCS | Can interact with DNA and histones and may scavenge nuclear material released from damaged circulating cells. | Down, Down, Down | Liver (25.7), Gall Bladder (1.3) |
| CLU | Isoform 1 functions as extracellular chaperone that prevents aggregation of nonnative proteins. | NR,Up, Up | Cerebral Cortex (4.7), Liver (3.6) |
| COL6A2 | Collagen VI acts as a cell-binding protein. | Up, Up, Up | HSC - TGFb 2.5 ng (8.9), HSC - Control (7.3) |
| CPQ | Carboxypeptidase that may play an important role in the hydrolysis of circulating peptides. | Down, NR, Down | Thyroid Gland (6.8), Gall Bladder (2.0) |
| PIGR | This receptor binds polymeric IgA and IgM at the basolateral surface of epithelial cells. | Down, NR, Up | Duodenum (8.7), Colon (5.7) |
| PLTP | Facilitates the transfer of a spectrum of different lipid molecules […]. Essential for the transfer of excess surface lipids from triglyceride-rich lipoproteins to HDL, thereby facilitating the formation of smaller lipoprotein remnants, contributing to the formation of LDL, and assisting in the maturation of HDL particles. PLTP also plays a key role in the uptake of cholesterol from peripheral cells and tissues that is subsequently transported to the liver for degradation and excretion. Two distinct forms of PLTP exist in plasma: an active form that can transfer PC from phospholipid vesicles to high-density lipoproteins (HDL), and an inactive form that lacks this capability. | Up, Up, Up | Placenta (7.2), Gall Bladder (2.5) |
The column on the left shows the tissue specificity for each gene in a panel of RNA-Seq data from 27 normal tissues (ArrayExpress E-MTAB-1733) complemented by RNA-Seq data from stellate cells, i.e. the control group and a sample group treated with TGFb (Gene Expression Omnibus GSE78853). Enrichment factors correspond to the fold change of the median expression in enriched tissue vs. median of median across all normal tissues.
Weight sets of six different tested rankings.
| # | Dimension | Plasma Specificity | Liver Specificity | Prior Evidence | Secreted Protein Class |
|---|---|---|---|---|---|
| 1 | Default | 0.3 | 0.3 | 0.2 | 0.2 |
| 2 | Prior Knowledge | 0.0 | 0.0 | 1.0 | 0.0 |
| 3 | Balanced | 0.2 | 0.2 | 0.4 | 0.2 |
| 4 | Liver/Plasma specific | 0.5 | 0.5 | 0.0 | 0.0 |
| 5 | Liver specific | 0.0 | 1.0 | 0.0 | 0.0 |
| 6 | Plasma specific | 1.0 | 0.0 | 0.0 | 0.0 |
The individual rankings were done on the total scores obtained by the weighted sum of individual dimension scores. For each ranking, all weights listed in Table 3 sum up to 1.