| Literature DB >> 34031464 |
Tristan V de Jong1,2, Victor Guryev3,4, Yuri M Moshkin5,6,7.
Abstract
Finding novel biomarkers for human pathologies and predicting clinical outcomes for patients is challenging. This stems from the heterogeneous response of individuals to disease and is reflected in the inter-individual variability of gene expression responses that obscures differential gene expression analysis. Here, we developed an alternative approach that could be applied to dissect the disease-associated molecular changes. We define gene ensemble noise as a measure that represents a variance for a collection of genes encoding for either members of known biological pathways or subunits of annotated protein complexes and calculated within an individual. The gene ensemble noise allows for the holistic identification and interpretation of gene expression disbalance on the level of gene networks and systems. By comparing gene expression data from COVID-19, H1N1, and sepsis patients we identified common disturbances in a number of pathways and protein complexes relevant to the sepsis pathology. Among others, these include the mitochondrial respiratory chain complex I and peroxisomes. This suggests a Warburg effect and oxidative stress as common hallmarks of the immune host-pathogen response. Finally, we showed that gene ensemble noise could successfully be applied for the prediction of clinical outcome namely, the mortality of patients. Thus, we conclude that gene ensemble noise represents a promising approach for the investigation of molecular mechanisms of pathology through a prism of alterations in the coherent expression of gene circuits.Entities:
Year: 2021 PMID: 34031464 PMCID: PMC8144599 DOI: 10.1038/s41598-021-90192-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1H1N1 and sepsis coordinately affect mean gene expression and inter-individual gene expression variability. (A) Inter-individual variability in whole blood gene expression (σ) increases in sepsis/CAP (top), other sepsis (mid), and H1N1 (bottom) patients as compared to healthy individuals. p(1–0)—p-values of t-tests comparing differences in inter-individual gene expression variability of healthy individuals (control) with survived (sepsis) and early H1N1 infected patients. p(2–1)—p-values of t-tests comparing differences of survived (sepsis) and early H1N1 infected patients with deceased (sepsis) and late H1N1 infected patients. Circles and whiskers indicate means and standard deviations respectively. (B) Correlations between variances in whole blood gene expression (σ2) and absolute changes in mean gene expression (|Δμ|) for healthy individuals (ctl.) and patients (sepsis, H1N1). Due to the fluctuation-response relationship, the magnitude of the mean gene expression response depends on its variance. We estimated common variances for genes in healthy and sepsis/CAP patients (top), healthy and other sepsis patients (mid), and healthy and H1N1 patients (bottom). (C) Correlations between variances in whole blood gene expression (σ2) and absolute changes in inter-individual gene expression variability (|Δσ2|) for control individuals (ctl.) and patients (sepsis, H1N1). (D) Correlations between absolute changes in mean gene expression (|Δμ|) and in inter-individual gene expression variability (|Δσ2|).
Figure 2Association of gene ensemble noise with H1N1 infection phase and sepsis mortality. (A) Venn diagram of KEGG- and CORUM-annotated biological pathways/protein complexes for which gene ensemble noise associates positively (increases) and significantly with disease state (mild/severe for the COVID-19, healthy/early/late for the H1N1, and healthy/survived/deceased for sepsis/CAP and other sepsis patients). (B) Plots of gene ensemble noise for genes involved in HIF-1 signalling, peroxisome, necroptosis, NOD-like receptor, and Fc epsilon RI signalling pathways. Pathways were annotated by KEGG. Kendall tau, and FDR- (H1N1 patients) and Bonferroni- (sepsis patients) adjusted p-values are indicated. Rank-based regression trend lines and 95% confidence bands of gene ensemble noise association with the state of disease are shown. Black circles and whiskers indicate means and standard deviations. (C) Plots of gene ensemble noise for genes encoding CORUM-annotated subunits of mitochondrial respiratory chain complex I (subcomplex I alpha—top panel and nuclear-encoded subunits—bottom panel). Rank-based regression trend lines and 95% confidence bands of gene ensemble noise association with the state of the disease are shown. (D) Methylene Blue (MB) acts as an alternative electron donor to the electron transport chain (red arrows) by shuttling between redox states (MB—MBH2) and, thus, bypassing respiratory chain complex I. Respiratory chain complex I-IV and their substrates are indicated, Q—coenzyme Q10, CytC—cytochrome C. Electrons are indicated as yellow circles.
Role in sepsis of the pathways for which gene ensemble noise associates positively (increases) with the disease states (healthy < early/survived < late/deceased).
| Pathways/Complexes | Role in sepsis pathology | References |
|---|---|---|
| KEGG: HIF-1 signalling pathway | Metabolic reprogramming of innate immune cells during the hyperinflammatory and immunotolerant phases of sepsis | [ |
| KEGG: Peroxisome | Defective peroxisome recycling alters cellular redox homeostasis and leads to exaggerated oxidative stress response to endotoxin (infection) and sepsis | [ |
| KEGG: Necroptosis | Necroptosis is implicated in pulmonary diseases and sepsis-associated organ injury | [ |
| KEGG: NOD-like receptor signalling pathway | Activation of Toll-like and NOD-like receptor signalling protects mice from polymicrobial sepsis-associated lethality | [ |
| KEGG: Fc epsilon RI signalling pathway | Fc receptors bind to antibodies attached to invading pathogens and their up-regulation can serve a potential biomarker for sepsis. Mice deficient for FCER1G gene encoding the γ-subunit of Fc epsilon RI show increased resistance to sepsis | [ |
| KEGG: Autophagy—other | Autophagy is an adaptive protective process that eliminates damaged proteins, organelles and pathogens. It is thought to be a promising target in treatment of sepsis | [ |
| KEGG: Biosynthesis of amino acids | Sepsis results in significant disorders in amino acids metabolism | [ |
| KEGG: Glucagon signalling pathway | Glucagon levels negatively associate with clinical outcome in sepsis patients | [ |
| KEGG: Propanoate (propionate) metabolism | Propionic acidaemia caused by altered propionate metabolism often results in sepsis and death | [ |
| KEGG: Circadian rhythm | There is accumulating evidence for association between circadian misalignment and severity of inflammatory responses in sepsis | [ |
| KEGG: Dopaminergic synapse | Dopamine mediates neuroimmune communications and dopaminergic is implicated in inflammation and sepsis | [ |
| KEGG: Amyotrophic lateral sclerosis (ALS) | ALS patients often develop pulmonary insufficiency and have increased risk of sepsis | [ |
| CORUM: Respiratory chain complex I, mitochondrial | Mitochondrial disfunction resulting in reduced respiratory chain complex I activity and low ATP levels is a whole mark for sepsis | [ |
| KEGG: Osteoclast differentiation | Mean expression of osteoclast differentiation genes is up-regulated in human septic shock | [ |
| KEGG: Tight junction | Sepsis disrupts intestinal barrier which leads to a multiple organ dysfunction syndrome and alters the expression of tight junction proteins | [ |
Figure 3The model predicting mortality/survival of sepsis (grouped as sepsis/CAP and other sepsis) patients. (A) Boxplots of the model scores predicting mortality/survivorship in the discovery (left) and validation (right) cohorts. The model was trained on the same data as the published discovery cohort by the gradient boosted regression tree and validated on an independent cohort[8]. Dashed lines indicate threshold levels of classification. The threshold was calculated by maximizing a product of the specificity and sensitivity of the model prediction in the discovery cohort. Further details of model accuracy are given in Tables 2 and S2. (B) Receiver operating characteristic curves (ROC) for the model predicting mortality (endpoint—survival or death within 28 days after treatment) in sepsis (grouped as sepsis/CAP and other sepsis) patients (blue line—discovery cohort, red line—validation cohort). Features were selected by the t-test comparing gene ensemble noise between the survived and deceased patients in the discovery cohort to achieve maximum prediction accuracy for the discovery cohort. Values for the area under the ROC curve (AUC) are indicated. (C) Survival probability for the patients predicted to have low (blue line) and high (green line) risk of mortality for the discovery (left panel) and validation (right panel) cohorts. P-values indicate significant differences in hazards for the predicted classes (survival/mortality) according to the Cox proportional-hazards model. Black lines—survival probability of patients with Mars1 endotype[8] was compared with the predicted deceased class for the discovery and validation cohorts. (D) Variable importance of the model ranks gene ensemble noise features according to their relative contribution (gain).
Prediction accuracy of the models for the sepsis and COVID-19 patients based on the gene ensemble noise and WGCNA eigengenes explanatory variables for the discovery (Disc.) and validation (Valid.) cohorts.
| Metric | Sepsis | Sepsis/CAP | Other sepsis | COVID-19 | ||||
|---|---|---|---|---|---|---|---|---|
| Disc | Valid | Disc | Valid | Disc | Valid | Disc | Valid | |
| bACC | 0.799 | 0.701 | 0.802 | 0.798 | 0.779 | 0.761 | 0.963 | 0.95 |
| Sensitivity | 0.754 | 0.6 | 0.88 | 0.867 | 0.75 | 0.8 | 0.974 | 1 |
| Specificity | 0.845 | 0.801 | 0.725 | 0.73 | 0.807 | 0.722 | 0.951 | 0.9 |
| bACC | 0.805 | 0.707 | 0.8 | 0.806 | 0.82 | 0.734 | 0.963 | 0.9 |
| Sensitivity | 0.799 | 0.614 | 0.8 | 0.867 | 0.798 | 0.769 | 0.974 | 0.9 |
| Specificity | 0.812 | 0.8 | 0.8 | 0.746 | 0.841 | 0.7 | 0.951 | 0.9 |
Figure 4The model predicting mortality/survival of sepsis/CAP patients. (A) Boxplots of the model scores predicting mortality/survivorship in the discovery (left) and validation (right) cohorts. (B) ROC curves for the model predicting mortality in sepsis/CAP patients in the discovery (blue line) and validation (red line) cohorts. Cohorts were partitioned as in the previous study[8]. (C) Survival probability for the patients predicted to have low (blue line) and high (green line) risk of mortality for the discovery (left panel) and validation (right panel) cohorts. (D) Relative contribution of gene ensemble noise features to the model.
Figure 5The model predicting mortality/survival of other sepsis patients. (A) Boxplots of the model scores. (B) ROC curves for the model predicting mortality in sepsis patients. Cohorts were partitioned as in the previous study[8]. (C) Survival probability for the patients predicted to have low (blue line) and high (green line) risk of mortality. (D) Relative contribution of gene ensemble noise features to the model.
Figure 6Association of gene ensemble noise with the COVID-19 disease state. (A) Inter-individual biological variability in leukocyte gene expression (bcv—biological coefficient of variation) increases in severe COVID-19 patients as compared to the mild ones. Left panel—average estimates of the bcv for all genes expressed in the patients’ leukocytes, right panel—bcv for the genes with significant changes in inter-individual biological variability (false discovery rate, FDR < 0.05). p-values of t-tests comparing differences in inter-individual gene expression variability for mild and severe COVID-19 patients. Circles and whiskers indicate means and standard deviations respectively. (B) Plots of gene ensemble noise for genes encoding subunits of mitochondrial respiratory chain complex I (subcomplex I alpha—left panel and nuclear-encoded subunits—right panel). Black circles and whiskers indicate means and standard deviations. t and p-values of the tests comparing gene ensemble noise for mild and severe COVID-19 patients are shown. (C) ROC curves for the model based on the gene ensemble noise for the discovery (blue line) and validation (red line) cohorts, and all samples (black line). For further details on the model accuracy see Tables 2 and S4. (D) Relative contribution of gene ensemble noise features to the model. For further details see Table S5.