| Literature DB >> 33564792 |
Phillip H Comella, Edgar Gonzalez-Kozlova, Roman Kosoy, Alexander W Charney, Irene Font Peradejordi, Shreya Chandrasekar, Scott R Tyler, Wenhui Wang, Bojan Losic, Jun Zhu, Gabriel E Hoffman, Seunghee Kim-Schulze, Jingjing Qi, Manishkumar Patel, Andrew Kasarskis, Mayte Suarez-Farinas, Zeynep H Gümüş, Carmen Argmann, Miriam Merad, Christian Becker, Noam D Beckmann, Eric E Schadt.
Abstract
The molecular mechanisms of chronic fatigue syndrome (CFS, or Myalgic encephalomyelitis), a disease defined by extreme, long-term fatigue, remain largely uncharacterized, and presently no molecular diagnostic test and no specific treatments exist to diagnose and treat CFS patients. While CFS has historically had an estimated prevalence of 0.1-0.5% [1], concerns of a "long hauler" version of Coronavirus disease 2019 (COVID-19) that symptomatically overlaps CFS to a significant degree (Supplemental Table-1) and appears to occur in 10% of COVID-19 patients[2], has raised concerns of a larger spike in CFS [3]. Here, we established molecular signatures of CFS and a corresponding network-based disease context from RNA-sequencing data generated on whole blood and FACs sorted specific peripheral blood mononuclear cells (PBMCs) isolated from CFS cases and non-CFS controls. The immune cell type specific molecular signatures of CFS we identified, overlapped molecular signatures from other fatiguing illnesses, demonstrating a common molecular etiology. Further, after constructing a probabilistic causal model of the CFS gene expression data, we identified master regulator genes modulating network states associated with CFS, suggesting potential therapeutic targets for CFS.Entities:
Year: 2021 PMID: 33564792 PMCID: PMC7872387 DOI: 10.1101/2021.01.29.21250755
Source DB: PubMed Journal: medRxiv
Fig. 1:Study Analysis Workflow.
A: RNA-seq read count data were generated on whole blood and FACs-sorted immune cell samples from CFS cases and controls. B: RNA-seq count data were passed through a viral-clonal detection pipeline. C: RNA-seq count data were passed through our MLFS and DE pipelines, generating predictive signatures of disease. D: Co-expression network construction organized genes into modules, which were annotated for biological pathways and other disease signatures. G: A union of modules with enrichment for CFS and CFS signatures were used with whole blood gene expression in an independent cohort to build a regulatory network where key drivers of disease were identified.
Fig. 2:Viral and Clonal analysis detects dysregulation in CFS.
A: Principal Component Analysis (PCA) of viral load estimated from the whole blood RNAseq data between patients and controls. B: Wilcoxon rank test of the viral mapping mean between patients and controls. C: Kolmogorov–Smirnov distribution test of the viral mapping mean between patients and controls. D: Clonal read support of T and B Cell clones between patients and controls.
Fig. 3:Machine Learning Feature Selection (MLFS) identifies predictive signatures of CFS.
A: Classifier models built from CFS signatures show predictive ROC AUC performance on hold-out test sets across the different cell types (y-axis and color). B: DE analysis for CFS vs HC. X-axis represents the number of DE genes; bars are colored by direction of expression. Y-axis represents the different p value cutoff <0.05 of either Nominal or FDR adjusted p-value. C: CFS signatures were established using MLFS for significance and DE for expression directionality. Signatures are colored per cell type. D: Jaccard index showing signature similarity. E. Top GO enrichment table for signatures, colored per cell type.
Fig. 4:Co-expression network analysis identifies modules of genes dysregulated in CFS and other disease signatures.
A: Table of all co-expression modules significantly associated with CFS signature, colored per cell type. X-axis represents either −log(FDR pvalue) or Odds Ratio (OR) of enrichment for CFS signature. The most significantly associated modules are highlighted with red boxes. B: Enrichment heatmap of top CFS modules and literature signatures from other disease. Y-axis represents disease signatures and are colored and grouped by the category of disease the signature falls into. C: Top 3 functional annotations of top CFS modules. This figure only shows those top modules with significant functional annotations.
Fig. 5:Bayesian Network and Key Driver Analysis identifies regulators of CFS.
A: A 2D representation of the Bayesian network. B and C illustrate key drivers of the network. B: shows the frequency in which a gene is considered a KD. C: shows the DE logFC of the KD gene in the cell specific signatures.