Literature DB >> 32859762

A pathogenic and clonally expanded B cell transcriptome in active multiple sclerosis.

Akshaya Ramesh1,2, Ryan D Schubert1,2, Ariele L Greenfield1,2, Ravi Dandekar1,2, Rita Loudermilk1,2, Joseph J Sabatino1,2, Matthew T Koelzer3, Edwina B Tran1,2, Kanishka Koshal1,2, Kicheol Kim1,2, Anne-Katrin Pröbstel1,2, Debarko Banerji1,2, Chu-Yueh Guo1,2, Ari J Green1,2, Riley M Bove1,2, Joseph L DeRisi4,5, Jeffrey M Gelfand1,2, Bruce A C Cree1,2, Scott S Zamvil1,2,6, Sergio E Baranzini1,2, Stephen L Hauser1,2, Michael R Wilson7,2.   

Abstract

Central nervous system B cells have several potential roles in multiple sclerosis (MS): secretors of proinflammatory cytokines and chemokines, presenters of autoantigens to T cells, producers of pathogenic antibodies, and reservoirs for viruses that trigger demyelination. To interrogate these roles, single-cell RNA sequencing (scRNA-Seq) was performed on paired cerebrospinal fluid (CSF) and blood from subjects with relapsing-remitting MS (RRMS; n = 12), other neurologic diseases (ONDs; n = 1), and healthy controls (HCs; n = 3). Single-cell immunoglobulin sequencing (scIg-Seq) was performed on a subset of these subjects and additional RRMS (n = 4), clinically isolated syndrome (n = 2), and OND (n = 2) subjects. Further, paired CSF and blood B cell subsets (RRMS; n = 7) were isolated using fluorescence activated cell sorting for bulk RNA sequencing (RNA-Seq). Independent analyses across technologies demonstrated that nuclear factor kappa B (NF-κB) and cholesterol biosynthesis pathways were activated, and specific cytokine and chemokine receptors were up-regulated in CSF memory B cells. Further, SMAD/TGF-β1 signaling was down-regulated in CSF plasmablasts/plasma cells. Clonally expanded, somatically hypermutated IgM+ and IgG1+ CSF B cells were associated with inflammation, blood-brain barrier breakdown, and intrathecal Ig synthesis. While we identified memory B cells and plasmablast/plasma cells with highly similar Ig heavy-chain sequences across MS subjects, similarities were also identified with ONDs and HCs. No viral transcripts, including from Epstein-Barr virus, were detected. Our findings support the hypothesis that in MS, CSF B cells are driven to an inflammatory and clonally expanded memory and plasmablast/plasma cell phenotype.
Copyright © 2020 the Author(s). Published by PNAS.

Entities:  

Keywords:  B cell; immune repertoire; multiple sclerosis; neuroimmunology

Mesh:

Substances:

Year:  2020        PMID: 32859762      PMCID: PMC7502747          DOI: 10.1073/pnas.2008523117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


Multiple sclerosis (MS) is a common autoimmune demyelinating disease of the central nervous system (CNS), affecting ∼1 million people in the United States (1). Although T cells are important effector cells in MS, it is now clear that B cells play a central role in both the relapsing and progressive forms of the disease (2–5). To date, microarray and bulk RNA-sequencing (RNA-Seq) studies of B cells from MS subjects have been carried out on CNS and blood samples with a focus on understanding differential expression of B cell receptor (BCR) genes in MS compared with healthy controls (HCs) (6–12). These studies have not yet been able to clearly define the transcriptome-wide profiles of CNS B cell subpopulations or compare them with their peripheral counterparts. More effective therapies against MS, especially against progressive disease, will likely require the targeting of residual CNS B cells, a heterogeneous population that may include culprit autoreactive clones as well as beneficial regulatory B cells that serve homeostatic functions. Thus, better clarifying the functional phenotypes of CNS B cell subtypes in MS may not only shed light on disease pathogenesis but also potentially provide more disease-specific and safer therapeutic targets to guide development of the next-generation of B cell therapeutics. Similar to a recent study (13), we performed RNA-Seq at single-cell resolution of paired cerebrospinal fluid (CSF) and blood samples from relapsing-remitting MS (RRMS), other neurologic diseases (ONDs), and HCs. Additionally, we paired single-cell transcriptome data with immunoglobulin repertoire sequencing (Ig-Seq) of B cells in MS so that transcriptomic phenotypes of B cells could be further delineated based on both Ig subclass as well as the degree to which a cell is clonally expanded. While this methodology makes it possible to simultaneously obtain transcriptional phenotypes and paired Ig heavy- and light-chain sequences from a single cell, the number of genes whose messenger RNA (mRNA) transcripts can be reliably detected in each cell remains relatively small with today’s single-cell technology (∼1,000 genes). Thus, to extend our findings to include more in-depth transcriptional phenotyping, we performed bulk RNA-Seq on five classical B cell subsets defined by CD19, CD27, and IgD expression from the CSF and blood on an independent cohort of seven treatment-naïve, RRMS subjects. We further leveraged our bulk RNA-Seq dataset to assess nonhost and human endogenous retrovirus (HERV) transcripts to look for evidence of viral transcription in both blood and CSF B cells.

Materials and Methods

Single-Cell RNA-Seq Cohort.

Study cohort and inclusion criteria.

All RRMS and clinically isolated syndrome (CIS) subjects were participants in either the University of California, San Francisco (UCSF) Origins or Expression, Proteomics, Imaging, Clinical (EPIC) studies (14, 15) and were diagnosed according to the 2017 McDonald criteria (16). Subjects were not on immunomodulatory or immunosuppressive disease-modifying therapy at the time of sample collection. OND and HC participants were enrolled in a biobanking study “Immunological Studies of Neurologic Subjects” (Dataset S1).

CSF and blood processing and single-cell sequencing.

CSF and blood were obtained during diagnostic and research procedures after subjects had provided informed consent; 20 to 30 mL of CSF was centrifuged immediately after collection at 400 × g for 15 min at 4 °C. The pellet was resuspended in ∼80 µL of residual supernatant, and lymphocytes were counted using a hemocytometer. Peripheral blood mononuclear cells (PBMCs) were isolated using cell preparation tubes as described previously (17) and resuspended in 2% fetal bovine serum. All experiments were performed immediately with freshly collected, unsorted cells. To obtain additional single-cell Ig-Seq (scIg-Seq) data for some patients, CSF or PBMCs were enriched for total B cells or for memory B cells via bead-based selection (EasySep Human Pan-B cell enrichment kit and EasySep Human Memory B cell Isolation Kit, respectively; StemCell Technologies). Sequencing libraries were prepared using 3′ or 5′ library preparation kits (10x Genomics).

Bioinformatic analysis.

Both single-cell RNA-Seq (scRNA-Seq) and scIg-Seq datasets were analyzed using the Cell Ranger (v3.0.1) count and variable diversity joining (VDJ) functions using the Ensembl GRCh38.v93 and GRCh38.v94 references, respectively. Data were analyzed using a custom bioinformatics pipeline that included Seurat (v3.1.2), the Spliced Transcripts Alignment to a Reference (STAR) algorithm (v2.5.1), SingleR (v1.1.7), and DoubletFinder (v2.0.2). The Immcantation pipeline, specifically IgBLAST (v1.4.0) and Change-O (v0.4.6), was used to assemble heavy- and light-chain contigs for each cell.

Quality control for barcode hopping.

Using a custom pipeline, overlapping reads present in more than one sample that shared the same cell barcode and unique molecular identifier (UMI) were counted and filtered using a single sample read assignment threshold percentage of 80% as previously described (18). If no sample accounted for at least 80% of the overlapping reads, the reads were removed and excluded from further analysis from both samples (code available at https://github.com/UCSF-Wilson-Lab/SingleCellVDJdecontamination).

Quality control for RNA-Seq data.

The 3′ and 5′ single-cell datasets were analyzed separately. All gene counts were aggregated together using Seurat. Only genes present in two or more cells were included. Only cells containing transcripts for 700 or more genes were included. For both the 5′ and 3′ data, maximum cutoffs of 2,500 and 4,000 genes were used, respectively. The PercentageFeatureSet function was used to calculate the percentage of mitochondrial transcript expression for each cell. Cells were omitted if they expressed at least 10% mitochondrial genes or were defined by the platelet markers PPBP and PF4. Gene counts were normalized using the R package SCTransform (19). In addition, nonregularized linear regression was used on the UMI counts per cell. Both datasets were clustered using 20 principal components in Seurat. Clusters were formed using a shared nearest neighbor graph in combination with dimensional reduction using uniform manifold approximation and projection (UMAP) (20). In order to optimize the total number of principal components, cumulative sums were iteratively calculated for each principal component to measure the percent variance accounted for with the data. A threshold of 90% variance was applied. As a result, 10 and 17 principal components were used for the 5′ and 3′ data, respectively. Both datasets were reclustered with the newly calculated principal components. Doublet detection and removal were performed using DoubletFinder (21) with expected doublet rates set based upon the 10x Genomics reference manual.

Cell type annotation and differential gene expression analysis.

Immune cell type identity was defined by performing differential gene expression (DGE) analysis for each cluster. The normalized gene expression profile for each cluster was compared with the remaining cells using a Wilcoxon rank sum test as provided in Seurat’s FindAllMarkers function (default parameters). The most up-regulated genes, with the highest positive average log fold change, were compared with a custom panel of canonical gene makers (Dataset S2) spanning several key immune cell types, including B cells, CD4+ T cells, CD8+ T cells, natural killer (NK) cells, classical monocytes, inflammatory monocytes, macrophage, plasmacytoid dendritic cells, and monocyte-derived dendritic cells. Clusters that predominantly expressed PPBP or HEMGN genes were omitted. In addition, to improve specificity, all B cells included in our analysis were required to have reads to CD79B. All clusters annotated as B cells were extracted and annotated further with SingleR using the combined Blueprint and ENCODE reference dataset (22–24). DGE analysis for heat map generation was performed using the FindMarkers command in Seurat with the Wilcoxon test and the following parameters: P-adjusted value cutoff = 0.05 and logFC cutoff = 0.25. DGE was performed using zinbwave (v1.8.0) (default parameters) (25) and DESeq2 (v1.26.0) (Dataset S3). For the DGE and subsequent downstream analysis, only paired CSF and blood transcriptome profiles of RRMS patients were included.

Pathway analysis.

Pathway analysis was carried out on both the 5′ and 3′ scRNA-Seq datasets using the Ingenuity Pathway Analysis (IPA) toolkit (v01.12) (26) (Dataset S4). We reported all pathways that were significant at Benjamini–Hochberg P adjusted of 0.05. In addition, upstream regulator analysis and causal network analysis were carried out using the IPA toolkit to identify likely upstream regulators of genes in the dataset. The activation state of a pathway or gene was calculated based on differentially expressed genes (P adjusted = 0.05), and all pathways/causal networks/master upstream regulators with a P adjusted less than 0.05 and Z score greater than or equal to absolute value of two were reported. Upstream and master regulators with a Z score ≥2 were considered activated and ≤−2 were considered inhibited. The ToppGene suite of tools and clusterProfiler (27) were used to identify Gene Ontology (GO) terms (biological and molecular processes) enriched in the differentially expressed genes. Only the overlapping statistically significant differentially expressed genes in the 5′ and 3′ scRNA-Seq datasets were used in ToppGene and clusterProfiler.

Identification of clonally related B cells.

Assembled contigs outputted from Cell Ranger were inputted into the Immcantation pipeline for a second round of alignment to the VDJ region using IgBLAST. Contigs containing fewer than three UMIs were omitted. Only contigs that aligned in frame (both the FUNCTIONAL and IN_FRAME output fields were TRUE) and across the constant region were retained. B cells were clustered based on similarities between their IGHV (immunoglobulin heavy-chain variable region gene), IGHJ (immunoglobulin heavy-chain joining region gene), and H-CDR3 (heavy-chain Complementary Determining Region 3) amino acid sequence. The Change-O toolkit generated a length-normalized bimodal distance-to-nearest distribution of hamming distances for all cells in the dataset. A Gaussian mixture model was applied to predict an appropriate hamming distance threshold (threshold = 0.15). B cells were defined as highly similar if they shared identical V and J genes as well as a CDR3 amino acid sequence within the accepted threshold. Any cell containing a single chain was omitted. Within an individual, B cells that are highly similar based on these criteria are likely clonally related (i.e., originating from the same progenitor B cell), but that is less likely when comparing B cells with highly similar heavy-chain sequences between individuals. Clonal expansion within an individual was defined as a highly similar B cell cluster containing two or more B cells (28–30).

B Cell Subset Bulk RNA-Seq Cohort.

All subjects were participants in either the Origins or EPIC studies at UCSF. Similar to patients included in the single-cell sequencing cohort, subjects had to be treatment-naïve at the time of sample collection with a clinical diagnosis of MS according to the 2017 McDonald criteria (16) (Dataset S1). Further, each subject had to have at least 7 of 10 B cell subsets (5 CSF and 5 blood subsets total) with 50 or more cells sorted by flow cytometry.

Sample collection and flow cytometry analysis.

CSF and blood were obtained during diagnostic and research procedures following informed consent. Fresh whole CSF and blood lymphocytes were labeled with the following antibodies: CD3, CD19, CD27, CD38, CD138, and IgD. Using a Beckman MoFlo Astrios cell sorter, CSF and blood B cell subsets were sorted directly into lysis buffer as previously described (17, 31): naïve B cells (N; CD19+IgD+CD27–), unswitched memory B cells (USM; CD19+IgD+CD27+), switched memory B cells (SM; CD19+IgD–CD27+), double-negative B cells (DN; CD19+IgD–CD27–), CSF plasmablast/plasma cells (CD19+IgD–CD27hi), and blood plasmablast/plasma cells (CD19+IgD–CD27hiCD38hi) (Dataset S1). These distinct B cell populations represent classic B cell subsets, as described previously (17, 32–34). Flow sorted cell lysates were stored at −80 °C until RNA extraction was performed.

Complementary DNA (cDNA) library preparation and next-generation sequencing.

RNA was extracted from the CSF and blood B cell subsets using the AllPrep DNA/RNA Micro kit (Qiagen), automated on the QIAcube robotics platform. RNA-Seq libraries were prepared with the Next Ultra II RNA Library Prep kit (New England Biolabs), automated on a Beckman-Coulter Biomek NXP robot. Individual cDNA libraries were barcoded with dual-indexing barcodes, pooled evenly, and sequenced on an Illumina HiSeq 4000 machine to generate 150-base pair paired-end reads.

Bioinformatic Analyses.

DGE and pathway analysis.

The bulk RNA-Seq cohort was analyzed independently of the scRNA-Seq cohort. Quality filtering of raw sequencing files was performed using the Paired-Read Iterative Contig Extension software tool kit (v1.2) (80% of sequences with a Q score > 0.98) (35). High-quality reads were aligned to the human genome (hg38) and the Genome-based Endogenous Viral Element database (36) using STAR (37). Final gene counts for the human genome (including HERV transcripts) were obtained using STAR (v2.5.3a) and samtools (v1.7), respectively (36–38). DGE and principal component analysis was conducted using DESeq2 (v2.1.14.1) using counts of all protein-coding genes expressed in greater than 10% of the samples (39) across all CSF and blood B cell subsets (Dataset S3). A false discovery rate and P-adjusted (Benjamini–Hochberg) cutoff of 0.05 were used to select the final list of differentially expressed genes. Cook’s distance, used for identification of outliers, was calculated in DESeq2 (v2.1.14.1). Genes containing an outlying count as defined by a Cook’s distance above the default threshold were flagged and not included in downstream analysis. DGE analysis was focused on memory B cells and plasmablast/plasma cells, with additional B cell subsets described in the . Pathway analysis was carried out using the IPA toolkit, with methods described in the single-cell cohort section. In and , we highlight pathways and GO terms that were statistically significant by at least one of the single-cell (3′ or 5′) and bulk RNA-Seq methods.

Immune repertoire analysis.

A custom bioinformatics pipeline incorporating MiXCR (v2.1.9) adapted for RNA-Seq data were used to identify Ig heavy-chain (IGH) and light-chain (IGK, IGL) germline genes from raw, paired-end sequencing reads (40). BCRs using identical IGHV and IGHJ germline segments and identical or near-identical H-CDR3 regions (minimum eight-amino acid length, maximum Hamming distance of two at the amino acid level) were considered highly similar (17, 41). Each candidate BCR sequence had to have a coverage depth of two or more across the entire length of the sequence to be included in the analysis.

Virus identification.

Viral transcripts were identified from raw sequencing reads using IDseq (v2.11) as previously described (42–44). To distinguish potential viruses from laboratory reagent and skin commensal flora contaminants, a Z score was calculated for the value observed for each genus relative to water (“no template”) and uninfected CSF controls (45).

Results

Overview of Study.

The structure of this study is outlined in Fig. 1. Across the scRNA-Seq and scIg-Seq cohorts, 24 total subjects were enrolled (n = 16 RRMS, 3 OND, 3 HC, 2 CIS) (Table 1 and Dataset S1). For gene expression analysis using scRNA-Seq, 12 RRMS (n = 10 from 5′ and n = 2 from 3′), 1 OND, and 3 HC were analyzed. For paired scIg-Seq, a subset of the scRNA-Seq gene expression cohort (seven RRMS, one OND, three HC) and two additional RRMS subjects were included. For analysis of B cell clonal connections, all available subjects, including those from whom matched unsorted CSF and blood samples were not available, were included (n = 14 RRMS, n = 3 OND, n = 2 CIS, and n = 3 HC). Sequencing libraries were prepared using the 10x Genomics 3′ and 5′ library preparation kits (detailed in Fig. 1 and Dataset S1).
Fig. 1.

Study overview and the CSF immune landscape in HCs and RRMS subjects. (A) A schematic representation and overview of sample collection, processing, and bioinformatic analysis. (B) Representative violin plots of paired CSF and blood from 5′ scRNA-Seq (n = 10 RRMS). The expression of canonical genes is shown for each of the Seurat clusters generated, with manual cell type annotations inputted for each Seurat cluster. (C) The 5′ gene expression from the samples represented in B with UMAP coordinates, and in D, heat maps for CSF (Upper) and blood (Lower) are shown. The top five genes exhibiting the strongest differential expression for each of the major lineages are displayed. The following scaling factors were chosen to maintain the overall proportion of cell types found in the CSF and blood: *Cell type down sampled by 66.67% to enable visualization of low-proportion cell types; **Cell type down sampled by 90% to enable visualization of low-proportion cell types. (E) Volcano plot of difference in proportion of each major cell type in the CSF compared with the blood with false discovery Q value of 0.05 for multiple t tests represented as a dashed line on a log scale. (F) Similar volcano plot of difference in proportion of each major cell type in RRMS vs. HC.

Table 1.

Subject demographics

DiagnosisAge (y)Female (%)EDSSCSF WBCOCBs (% positive)IgG indexGd (% positive)Months from onset
RRMS (n = 7, bulk)25–40711.5–2.51–20860.66–1.4431–79
RRMS (n = 16, single cell)22–54690–40–13810.46–1.75630–166
CIS (n = 2, single cell)35–531002.5–42–1100.47–0.51501–3
OND* (n = 3, single cell)21–72100N/A0–41000.52–0.7005–90
HC (n = 3, single cell)27–4133N/A0–100.51–0.58N/AN/A

Seven treatment-naïve RRMS subjects whose age at enrollment ranged from 23 to 40 were included in the bulk RNA-Seq cohort. Twenty-four subjects (n = 16 RRMS, n = 2 CIS, n = 3 OND, n = 3 HC) (Fig. 1 and Dataset S1 have a breakdown of samples included in the RNA-Seq and Ig-Seq analyses) whose age at enrollment ranged from 21 to 72 were included in the single-cell cohort. OCB, oligoclonal band; EDSS, expanded disability status scale; WBC, white blood cell; N/A, not applicable; Gd, gadolinium.

Subjects in the OND group had diagnoses of neurosarcoidosis, atypical neuroinflammatory syndrome with uveitis, and AQP4+ neuromyelitis optica.

Study overview and the CSF immune landscape in HCs and RRMS subjects. (A) A schematic representation and overview of sample collection, processing, and bioinformatic analysis. (B) Representative violin plots of paired CSF and blood from 5′ scRNA-Seq (n = 10 RRMS). The expression of canonical genes is shown for each of the Seurat clusters generated, with manual cell type annotations inputted for each Seurat cluster. (C) The 5′ gene expression from the samples represented in B with UMAP coordinates, and in D, heat maps for CSF (Upper) and blood (Lower) are shown. The top five genes exhibiting the strongest differential expression for each of the major lineages are displayed. The following scaling factors were chosen to maintain the overall proportion of cell types found in the CSF and blood: *Cell type down sampled by 66.67% to enable visualization of low-proportion cell types; **Cell type down sampled by 90% to enable visualization of low-proportion cell types. (E) Volcano plot of difference in proportion of each major cell type in the CSF compared with the blood with false discovery Q value of 0.05 for multiple t tests represented as a dashed line on a log scale. (F) Similar volcano plot of difference in proportion of each major cell type in RRMS vs. HC. Subject demographics Seven treatment-naïve RRMS subjects whose age at enrollment ranged from 23 to 40 were included in the bulk RNA-Seq cohort. Twenty-four subjects (n = 16 RRMS, n = 2 CIS, n = 3 OND, n = 3 HC) (Fig. 1 and Dataset S1 have a breakdown of samples included in the RNA-Seq and Ig-Seq analyses) whose age at enrollment ranged from 21 to 72 were included in the single-cell cohort. OCB, oligoclonal band; EDSS, expanded disability status scale; WBC, white blood cell; N/A, not applicable; Gd, gadolinium. Subjects in the OND group had diagnoses of neurosarcoidosis, atypical neuroinflammatory syndrome with uveitis, and AQP4+ neuromyelitis optica. Seven treatment-naïve RRMS subjects were enrolled in the bulk RNA-Seq cohort (Table 1 and Dataset S1). Flow cytometric B cell subset distributions for the CSF and blood were determined for all seven subjects in the bulk RNA-Seq cohort (Dataset S1). Principal component analysis demonstrated that B cells primarily clustered based on their subset type and the body compartment from which they were isolated ().

Distinct T Cells and Myeloid Cells Define the Cerebrospinal Fluid Immune Environment, with B Cells Abnormally Enriched in RRMS Compared with HCs.

We analyzed 5′ scRNA-Seq libraries from paired blood and CSF from 10 RRMS subjects. Expression of canonical immune cell markers was determined for each unsupervised Seurat cluster (Fig. 1). To determine the relationship between cells, we plotted the data using UMAP coordinates and revealed distinct cell types (Fig. 1 and ). We found distinct differences in the myeloid and lymphoid populations in the CSF relative to blood of MS patients, consistent with previous reports (13, 46, 47). CSF macrophages, also termed microglia-like or CSF microglial cells, were present in both RRMS and HCs (). In patients with RRMS, comparison of CSF cell types relative to blood found no cell type with a Q value < 0.05 (Fig. 1). In comparison with HCs, CSF B cells were 13 times more abundant in RRMS subjects (Fig. 1 and ).

Integrated Bulk and Single-Cell Transcriptomic Analysis of B Cells in RRMS Identifies Distinct Changes in CSF Memory Cells and Plasmablast/Plasma Cells.

We were able to clearly identify three B cell subtypes in the 5′ and 3′ scRNA-Seq datasets: naïve, memory, and plasmablast/plasma cells (Fig. 2), with naïve and memory B cells clustered in a transcriptional continuum in UMAP space (Fig. 2). In both the bulk (flow cytometry) and single-cell cohorts, CSF contained higher proportions of memory B cells and plasmablast/plasma cells than in peripheral blood (Fig. 2). Expressions of classic B cell markers were similar across bulk and scRNA-Seq ().
Fig. 2.

Single-cell and bulk profiling of B cells in RRMS. (A) Heat map of B cells from paired RRMS CSF and blood samples from the 5′ scRNA-Seq gene expression analysis (n = 10 subjects) revealing clustering into three main groups of B cells. (B) Representation of this dataset in UMAP space extracted from and zoomed in on from the UMAP coordinates represented in Fig. 1. Annotations generated algorithmically using SingleR with the Encode reference set. (C) Flow cytometry B cell results from 7 paired RRMS blood and CSF samples for bulk RNA-Seq and 12 paired RRMS blood and CSF samples (n = 10 from 5′ and n = 2 from 3′ scRNA-Seq) from scRNA-Seq. Statistical tests were performed using the Mann–Whitney U test; mean ± SEM is shown. (D) Network diagram of CSF switched memory cells with a focus on the cholesterol biosynthesis pathways (Upper) and corresponding heat map of differentially expressed genes (Lower) in the bulk RNA-Seq cohort. Pathway data were generated using clusterProfiler. Genes are depicted as outer nodes in shades of red (corresponding to fold enrichment), while GO terms are highlighted centrally in yellow. (E) Schematic representation of upstream NF-κB pathway members predicted to be activated (Upper), and heat map of differentially expressed downstream NF-κB pathway members genes (Lower) in CSF SM cells from the bulk RNA-Seq cohort. (F) Normalized expression of genes involved in inflammatory pathways in CSF and blood plasmablast/plasma cells. PB/PC, plasmablast/plasma cells. *P < 0.05, **P < 0.01, ***P < 0.001.

Single-cell and bulk profiling of B cells in RRMS. (A) Heat map of B cells from paired RRMS CSF and blood samples from the 5′ scRNA-Seq gene expression analysis (n = 10 subjects) revealing clustering into three main groups of B cells. (B) Representation of this dataset in UMAP space extracted from and zoomed in on from the UMAP coordinates represented in Fig. 1. Annotations generated algorithmically using SingleR with the Encode reference set. (C) Flow cytometry B cell results from 7 paired RRMS blood and CSF samples for bulk RNA-Seq and 12 paired RRMS blood and CSF samples (n = 10 from 5′ and n = 2 from 3′ scRNA-Seq) from scRNA-Seq. Statistical tests were performed using the Mann–Whitney U test; mean ± SEM is shown. (D) Network diagram of CSF switched memory cells with a focus on the cholesterol biosynthesis pathways (Upper) and corresponding heat map of differentially expressed genes (Lower) in the bulk RNA-Seq cohort. Pathway data were generated using clusterProfiler. Genes are depicted as outer nodes in shades of red (corresponding to fold enrichment), while GO terms are highlighted centrally in yellow. (E) Schematic representation of upstream NF-κB pathway members predicted to be activated (Upper), and heat map of differentially expressed downstream NF-κB pathway members genes (Lower) in CSF SM cells from the bulk RNA-Seq cohort. (F) Normalized expression of genes involved in inflammatory pathways in CSF and blood plasmablast/plasma cells. PB/PC, plasmablast/plasma cells. *P < 0.05, **P < 0.01, ***P < 0.001.

CSF Antigen-Experienced Memory B Cells Activate Nuclear Factor kappa-Light-Chain-Enhancer of Activated B Cells (NF-κB) and Proinflammatory Cytokines.

IgD–CD27+ SM.

A total of 294 genes were differentially expressed between CSF and blood SM B cells, with 166 of these up-regulated in the CSF (Dataset S3). Compared with their peripheral counterparts, CSF SM B cells up-regulated CD138 and CD83 and down-regulated CD52, CD180, CD53, CD22, and CD48 (Table 2 and Dataset S3). Additionally, IL10RA, IL2RB, TGFB1, TBX21, CXCR3, CXCR4, CCR2, and CCR5 were up-regulated by CSF SM B cells. A total of 18 upstream regulators were significant, including the transmembrane receptor TREM1 (Z = 2.2) and the NF-κB (Z = 2.7) complex, both predicted to be activated (Fig. 2 and Dataset S4). Additionally, CEBPB, a transcriptional regulator downstream of the NF-κB pathway, was also predicted to be activated (Z = 2.2), implying activation of the canonical NF-κB pathway. Among chemokines, both CCL2 and CCL11 were predicted to be activated (Dataset S4). By IPA, 59 networks were significant, 50 of which were predicted to be activated (Dataset S4). CSF SM B cells showed enrichment of GO terms corresponding to an inflammatory profile, not observed in their peripheral counterparts (Dataset S4). Similarly, clusterProfiler analysis also revealed an enrichment of GO terms related to cholesterol metabolism and biosynthesis (Fig. 2) (Dataset S4). An enrichment of GO terms corresponding to transcription, translation, and catabolic processes was also seen in the blood SM B cells (Dataset S4).
Table 2.

RNA-Seq analysis findings in B cell subsets

B cell subsetCell surface transcripts up in CSFCell surface transcripts down in CSFTranscripts up in CSFSignificant upstream regulatorsSignificant networks
NCD82CD22, CD52, CD74IL10RA, CXCR4, RASGRP2, IL2RBIL3, TLR9
DNCD3E, CD5CD200TPSD1, USP21
USMCD83CD52, CD180, CD27IL10RA, CXCR4, TGFB1NF-κB, TREM-1IL3, IL6, IL7, IL32, OSM, TSLP, CXCL8
SMCD138, CD83CD52, CD180, CD53, CD22, CD48IL10RA, CXCR3, TBX21, IL2RB, CXCR4, CCR5, CCR2, AKNA, TGFB1NF-κB, CCL2, CCL11
Plasmablast/plasma cellsCD138, CD74, CD79A, CD46, BCRAKNA, CCR5, IL21R, UBA7, CXCR4, TBX21, IL16, SOX5, CXCR3, PPARAIL4*

Curated overview of transcripts up- and down-regulated between CSF and blood, significant causal networks, and upstream regulators predicted to be activated/inhibited using the IPA toolkit in five B cell subsets (N, DN, USM, SM, and plasmablasts/plasma cells).

Pathway inhibited.

RNA-Seq analysis findings in B cell subsets Curated overview of transcripts up- and down-regulated between CSF and blood, significant causal networks, and upstream regulators predicted to be activated/inhibited using the IPA toolkit in five B cell subsets (N, DN, USM, SM, and plasmablasts/plasma cells). Pathway inhibited.

IgD+CD27+ USM.

A total of 228 genes were differentially expressed between CSF and blood USM B cells; 96 were up-regulated in the CSF (Dataset S3). Compared with their blood counterparts, CSF USM B cells up-regulated CD83, IL10RA, CXCR4, and TGFB1 and down-regulated CD180, CD52, and CD27 (Table 2 and Dataset S3). IPA network analysis predicted that five upstream regulators were activated including TREM-1 (Z = 2.5) and NF-κB (Z = 2.6) (Dataset S4). Thirty-four master regulators were significant, of which 29 were predicted to be activated, including the following cytokines: IL3, IL6, IL7, IL32, OSM, TSLP, and CXCL8 (Dataset S4). Only one GO term was enriched in CSF USM B cells: “skeletal muscle development.” GO terms related to transcription, translation, and catabolic processes were enriched in blood USM B cells (Dataset S4 ). One pathway, “EIF2 signaling,” was significantly inhibited in both USM and SM CSF B cell subsets (Dataset S4). We next compared our bulk RNA-Seq data with our scRNA-Seq data. Totals of 1,231 and 1,221 genes were differentially expressed between CSF and blood memory B cells in the 3′ and 5′ single-cell gene expression datasets, respectively (Dataset S3). Of these, 430 were differentially expressed (59 up-regulated in the CSF) across both the 3′ and 5′ datasets. In comparison with their blood counterparts, CSF memory B cells up-regulated CXCR4 and CD70 and down-regulated CD180, CD22, CD27, CD37, CD40, CD52, CD53, CD72, CD74, CD79A, and CD79B. Eight upstream regulators (three activated: PRDM1, NR4A1, and MAPK1; five inhibited: STK11, PAX5, PML, PPARA, and PPARGC1A) were significant (Dataset S4). Seven upstream regulators including TREM1, NF-κB complex, and RELA were all predicted to be activated in the single-cell CSF memory B cell 5′ cohort, similar to the bulk RNA-Seq cohort (Dataset S4). Seventeen causal networks (10 activated, 7 inhibited) were significant in both 3′ and 5′ datasets; 11 causal networks, including CXCL8, were found overlapping with the bulk RNA-Seq cohort (Dataset S4). Additionally, pathway (5′ dataset) and GO (3′ and 5′ datasets) analyses showed that cholesterol biosynthesis was activated in CSF memory B cells (Dataset S4 ). In blood memory B cell counterparts, GO terms associated with antigen processing such as “peptide antigen binding” and “MHC class II protein complex binding” were enriched.

CSF CD27hi Plasmablast/Plasma Cells Display an Inflammatory Phenotype and Down-Regulate Genes in the TGF-β1 Pathway.

A total of 204 genes (63 up-regulated in the CSF) were differentially expressed between CSF and blood plasmablast/plasma cells (Dataset S3). Compared with their peripheral counterparts, CSF plasmablast/plasma cells up-regulated CD138, CD74, CD79A, CD46, and BCR (Table 2 and Dataset S3). Genes related to cytokines/chemokines up-regulated in CSF plasmablast/plasma cells included IL21R, UBA7, TBX21, IL16, SOX5, CXCR4, CXCR3, and CCR5 (Fig. 2). CSF plasmablast/plasma cells enriched for inflammation GO terms including “cytokine receptor activity,” “chemokine receptor activity,” and “G protein-coupled chemoattractant receptor activity” and terms associated with cholesterol biosynthesis (Dataset S4). Similarly, clusterProfiler analysis revealed enrichment of terms associated with fatty acid biosynthesis and inflammation (Dataset S4). “SMAD protein import into the nucleus,” corresponding to regulation of SMAD protein signal transduction, involved in the TGF-β1 signaling pathway (BMPR1A, TOB1, JUN, SPTBN1) was down-regulated in CSF plasmablast/plasma cells (Dataset S4). Finally, CSF plasmablast/plasma cells did not express more IL10RA than their blood counterparts (Dataset S3). Three upstream regulators were significant using IPA (Dataset S4), including inhibition of IL4, a Th2 cytokine (Z = −2). In comparison, 225 and 49 genes were differentially expressed between CSF and blood plasmablast/plasma cells in the 3′ and 5′ scRNA-Seq datasets, respectively (Dataset S3). Of these, only 30 genes (10 up-regulated in the CSF) were shared, with no overlaps in the pathways, predicted master regulators, or causal networks. Additionally, the three significant upstream regulators that were significant in the bulk RNA-Seq analysis were not significant in the single-cell cohort. In the 3′ gene expression dataset, similar to our findings in the bulk study, IL16 was up-regulated in the CSF plasmablast/plasma cells (Dataset S3). Interestingly, TGFB-1 was predicted to be inhibited in the 5' gene expression dataset (Dataset S4); with a downregulation of genes in the TGF-β1 signaling pathway also observed in the CSF plasmablast/plasma cells using the clusterProfiler analysis in the bulk RNA-Seq cohort (Dataset S4). GO analysis using only the genes differentially expressed across both datasets revealed an abundance of terms associated with iron transport protein targeting and translocation (Dataset S4), and while GO terms associated with cholesterol biosynthesis were not enriched, a fourfold enrichment of the enzyme SQLE, necessary for the initiation of cholesterol biosynthesis (P adjusted = 0.001), was observed in the 3′ gene expression dataset. Similar to the bulk RNA-Seq cohort, IL10RA was not differentially expressed between the CSF plasmablast/plasma cells and their blood counterparts (Dataset S3).

Paired scRNA-Seq and scIg-Seq Reveals Clonally Expanded, Somatically Hypermutated IgM and IgG1 CSF B Cells Associated with Acute Inflammation and Intrathecal Ig Synthesis.

We next leveraged our 5′ scRNA-Seq libraries to perform full-length scIg-Seq analysis on paired CSF and blood from RRMS subjects (n = 9) (Fig. 1 and Dataset S1). As expected, we found a statistically greater proportion of IgG1+ B cells between the CSF and blood and a roughly equal proportion of IgG1+ and IgM+ B cells within the CSF (Fig. 3). In our bulk RNA-Seq dataset, we observed similar findings, with IgM-expressing cells found in the naïve and USM subsets and IgG1-expressing cells found predominantly in the plasmablast/plasma cells, SM, and DN subsets (Fig. 3). These findings recapitulate expected changes in the isotypes of CSF B cells (48, 49).
Fig. 3.

Paired scIg-Seq in RRMS ties clonal B cell expansion to intrathecal Ig synthesis and blood–brain barrier breakdown and to the emergence of IgM+ B cells undergoing SHM in the CSF. (A) Isotype usage by scIg-Seq in the 5′ dataset from n = 9 paired CSF and blood RRMS subjects. Statistics were performed with Mann–Whitney U test; mean ± SD is shown. (B) Isotype usage by limited Ig-Seq data extracted from bulk RNA-Seq. (C) Linear correlation between the clinical CSF WBC count vs. total B cells by scRNA-Seq and the clinical IgG index. (D) Linear correlation between the number of expanded B cells detected by scRNA-Seq and the clinical IgG index. (E) Enrichment of clonally expanded CSF, but not blood, B cells in the same subjects with active gadolinium on MRI. Statistics performed with Mann–Whitney U test. (F) Alpha diversity plots from the Immcantation framework depicting restricted diversity in the CSF relative to the blood for both IgM+ and IgG1+ B cells (n = 9 RRMS subjects). There were too few B cells in the HCs to generate comparison plots. Blood is depicted in red, and CSF is depicted in blue. (G) Mutation frequency, basic residue count, and CDR3 length in the CSF and blood of RRMS subjects by scIg-Seq. Statistical test was performed using a Student’s t test. (H) Heat map of the top 20 most differentially expressed genes by the Seurat FindMarkers command in IgM+ B cells with and without replacement mutations. (I) UMAP of IgM+ B cells from scIg-Seq overlaid onto 5′ scRNA-Seq gene expression data from HC (n = 3; 2 had zero B cells with productive VDJ sequences), RRMS (n = 7), and OND (n = 1) subjects. Dot plot adjacent to UMAP with statistical testing comparing the total number of B cells detected in the CSF and blood with either IGM+ SHM or IGM− SHM. Statistical testing was performed with the Mann–Whitney U test. PB/PC, plasmablast/plasma cells; WBC, white blood cell. *P < 0.05, ***P < 0.001, ****P < 0.0001.

Paired scIg-Seq in RRMS ties clonal B cell expansion to intrathecal Ig synthesis and blood–brain barrier breakdown and to the emergence of IgM+ B cells undergoing SHM in the CSF. (A) Isotype usage by scIg-Seq in the 5′ dataset from n = 9 paired CSF and blood RRMS subjects. Statistics were performed with Mann–Whitney U test; mean ± SD is shown. (B) Isotype usage by limited Ig-Seq data extracted from bulk RNA-Seq. (C) Linear correlation between the clinical CSF WBC count vs. total B cells by scRNA-Seq and the clinical IgG index. (D) Linear correlation between the number of expanded B cells detected by scRNA-Seq and the clinical IgG index. (E) Enrichment of clonally expanded CSF, but not blood, B cells in the same subjects with active gadolinium on MRI. Statistics performed with Mann–Whitney U test. (F) Alpha diversity plots from the Immcantation framework depicting restricted diversity in the CSF relative to the blood for both IgM+ and IgG1+ B cells (n = 9 RRMS subjects). There were too few B cells in the HCs to generate comparison plots. Blood is depicted in red, and CSF is depicted in blue. (G) Mutation frequency, basic residue count, and CDR3 length in the CSF and blood of RRMS subjects by scIg-Seq. Statistical test was performed using a Student’s t test. (H) Heat map of the top 20 most differentially expressed genes by the Seurat FindMarkers command in IgM+ B cells with and without replacement mutations. (I) UMAP of IgM+ B cells from scIg-Seq overlaid onto 5′ scRNA-Seq gene expression data from HC (n = 3; 2 had zero B cells with productive VDJ sequences), RRMS (n = 7), and OND (n = 1) subjects. Dot plot adjacent to UMAP with statistical testing comparing the total number of B cells detected in the CSF and blood with either IGM+ SHM or IGM− SHM. Statistical testing was performed with the Mann–Whitney U test. PB/PC, plasmablast/plasma cells; WBC, white blood cell. *P < 0.05, ***P < 0.001, ****P < 0.0001. We next investigated whether these findings correlated with clinical laboratory testing or radiographic characteristics of the patients. We found a significant linear correlation between the clinical IgG index and the total number of CSF B cells we identified in each subject by scRNA-Seq, relative to a weaker correlation between the clinical total white blood cell count and the CSF IgG index (R2 = 0.524, P = 0.027 vs. R2 = 0.278, P = nonsignificant [ns]) (Fig. 3). There was an even stronger correlation between the number of CSF clonally expanded B cells by scRNA-Seq and the CSF IgG index (R2 = 0.561, P = 0.02) (Fig. 3). Further, we found a greater number of clonally expanded CSF B cells in RRMS subjects with gadolinium enhancement on MRI (P = 0.024) (Fig. 3). This increase was unique to the CSF, as there was clonal expansion of CSF B cells relative to the blood in gadolinium-positive subjects (P < 0.01) and no difference in the proportion of clonally expanded B cells between the blood samples of gadolinium-positive and -negative subjects (P = ns) (Fig. 3). We next explored the B cell immune repertoire diversity in the CSF and blood. Interestingly, the B cell clonal expansion in the CSF manifested as restricted diversity in both the IgM+ and IgG1+ cells (Fig. 3). Further comparing all of the scIg-Seq–derived IgM+ and IgG1+ RRMS B cells, we observed an increase in the replacement mutation frequency of the CSF IgM+ CDR sequences vs. their blood counterparts but no difference between blood IgG1+ cells, extending previous bulk Ig-Seq findings (Fig. 3) (P < 0.001 and P = ns, respectively) (50). The H-CDR3 of CSF IgM+ cells contained more basic amino acid residues relative to the blood, and the CDR3 of CSF IgG1+ cells was longer relative to the blood, supportive of antigen experience in these cells (Fig. 3) (P < 0.01). To determine the phenotype of replacement mutation-bearing IgM+ B cells, we combined scRNA-Seq and scIg-Seq to analyze CSF and blood IgM+ B cells, categorizing them as either replacement mutation negative (i.e., germline) or replacement mutation positive (i.e., having undergone somatic hypermutation [SHM]) to identify differentially expressed genes between them. The genes that best distinguished germline and SHM IgM+ B cells are shown (Fig. 3). Germline B cells were TCL1A+, CD27-, TNFRSF13B-, and GPR183-, supporting their identity as naïve B cells. SHM+ IgM+ B cells were TCL1A-, CD27+, GPR183+, and TNFRSF13B+, supporting their identity as memory B cells with a phenotype resembling previously described prediversified splenic marginal zone IgM+ B cells (Fig. 3) (51). Indeed, when we superimposed IgM+ germline and SHM+ B cells onto our 5′ scRNA-Seq gene expression UMAP space, the SHM- cells clustered primarily with naïve B cells, and SHM+ cells clustered primarily with memory B cells (Figs. 2 and 3). Of note, we also identified three IgM+ SHM+ B cells in HC CSF, suggesting that their simple presence in the CNS is not necessarily indicative of a diseased state.

CSF, but Not Blood, B Cell Clonal Expansion Is Linked to the Presence of Unique Oligoclonal Bands in RRMS, and Some CSF B Cells in RRMS Are Highly Similar to B Cells in the Blood of Controls.

To examine whether B cell clonal expansion in RRMS is unique to the CSF, we compared the degree of clonal expansion in the blood and CSF of nine subjects with RRMS, one subject with an atypical neuroinflammatory disorder and uveitis (OND), and three HCs. Of the three HCs, two had no B cells detected in the CSF by scIg-Seq. Substantial B cell clonal expansion was observed in the CSF of patients with RRMS and OND. In contrast, clonal expansion was not detected in the CSF of the HC (Fig. 4). As expected, we observed that CSF B cell clonal expansion was linked to the presence of unique oligoclonal bands in RRMS (Fig. 4). We overlapped the scRNA-Seq and scIg-Seq data from seven of our RRMS subjects and identified clonal expansion primarily in the memory and plasmablast/plasma cell compartments (Fig. 4).
Fig. 4.

(A) Donut charts of paired scIg-Seq results from blood and CSF in RRMS (n = 9), OND (n = 1), and HC (n = 3; 2 subjects had zero CSF B cells by scIg-Seq). Adjacent plots demonstrating clonal expansion in RRMS subjects with or without unique CSF oligoclonal bands (OCBs). Unique indicates a clone seen in only one cell in a subject’s sample. Clonal expansion was further divided into clone counts of two, three, four, and more than four. (B) Dot plot of cell types with clonal expansion in seven RRMS subjects in whom overlapping gene expression data were available. (C) Venn diagrams of shared B cells, determined by highly similar Ig heavy-chain sequences in the scIg-Seq cohort from an expanded set of subjects, including those without paired CSF and blood samples and with sorted B cell subsets as listed in Fig. 1 and Dataset S1 (n = 14 RRMS, n = 3 HC, n = 3 OND, n = 2 CIS). (D) Proportions of public (present in more than one subject) and private (present in only one subject) CSF B cells connected to blood (CSF/blood) and public blood B cells connected to other blood B cells (blood/blood). Of 693 total CSF B cell connections, 5 of 693 were highly similar to blood across subjects (publicly), and 25 of 693 were highly similar to blood within the same subject (privately). Of the 137,416 blood B cell connections, 284 of 137,416 were shared publicly.

(A) Donut charts of paired scIg-Seq results from blood and CSF in RRMS (n = 9), OND (n = 1), and HC (n = 3; 2 subjects had zero CSF B cells by scIg-Seq). Adjacent plots demonstrating clonal expansion in RRMS subjects with or without unique CSF oligoclonal bands (OCBs). Unique indicates a clone seen in only one cell in a subject’s sample. Clonal expansion was further divided into clone counts of two, three, four, and more than four. (B) Dot plot of cell types with clonal expansion in seven RRMS subjects in whom overlapping gene expression data were available. (C) Venn diagrams of shared B cells, determined by highly similar Ig heavy-chain sequences in the scIg-Seq cohort from an expanded set of subjects, including those without paired CSF and blood samples and with sorted B cell subsets as listed in Fig. 1 and Dataset S1 (n = 14 RRMS, n = 3 HC, n = 3 OND, n = 2 CIS). (D) Proportions of public (present in more than one subject) and private (present in only one subject) CSF B cells connected to blood (CSF/blood) and public blood B cells connected to other blood B cells (blood/blood). Of 693 total CSF B cell connections, 5 of 693 were highly similar to blood across subjects (publicly), and 25 of 693 were highly similar to blood within the same subject (privately). Of the 137,416 blood B cell connections, 284 of 137,416 were shared publicly. Nine of 8,692 (0.1%) and 90 of 138,109 (0.06%) B cells identified exclusively across RRMS patients in the bulk and single-cell cohorts were highly similar, respectively (Fig. 4 and ). However, the presence of highly similar B cells across subjects was not specific to a given diagnosis. In the scRNA-Seq cohort, we found that 5 of 693 (0.72%) blood/CSF B cells across subjects, including in HC and OND controls, were highly similar, compared with 284 of 137,416 (0.21%) blood/blood B cells. In the scRNA-Seq cohort, we found a greater number (25 of 693, 3.61%) of B cells that were highly similar across the blood/CSF immune axis within individuals (Fig. 4).

Lack of Epstein–Barr Virus mRNA in MS Blood and CSF B Cell Subsets.

To address the possibility of Epstein–Barr virus (EBV) transcription (or transcription of RNA from any other pathogen) in the B cells from RRMS subjects, we mined our bulk dataset for nonhuman reads. An average of 1,725,779 (interquartile range: 9,657 to 2,326,989) sequences per sample were nonhuman, high quality, and nonredundant (Dataset S1). Of these, all sequences aligned to common environmental and reagent contaminants. No human virus transcripts, including EBV transcripts, were detected using IDseq despite detecting EBV transcripts in positive controls (i.e., RNA from the EBV-infected EHEB cell line) at total input RNA masses ranging from 10 pg to 10 ng. EBV transcripts in the positive controls ranged from 24.4 to 68.7 reads per million. Based on the limit of detection we calculated in the EHEB control, we calculate for each CSF and blood sample the approximate minimum number of EBV-infected cells we would have been able to detect at the sequencing depths we used for each sample (Dataset S1). Additionally, no DGE in HERVs/repetitive elements was observed between individual blood and CSF B cell subsets. However, eight genes, including HERVK9-int (P adjusted = 4.68E-07) (), were up-regulated in blood B cells compared with the CSF. The other seven genes up-regulated in the blood were LINE/L1 elements.

Discussion

While monoclonal antibody therapy against CD20-expressing B cells represents a highly effective treatment approach in MS, the roles B cells play in this complex disease are still not fully understood (52). A better understanding of B cell phenotypes in MS, particularly in the relevant body compartment (i.e., CNS), is vital to develop more effective and better targeted B cell depletion strategies. In this study, we independently characterized the transcriptional phenotypes and immune repertoires of B cell subsets from the CSF and blood of treatment-naïve RRMS subjects at both the bulk and single-cell levels. We have highlighted the potential contribution of distinct subsets of B cells to ongoing inflammation. In addition, we characterized SHM patterns with corresponding transcriptome profiles in CSF B cells in the single-cell cohort and mined the bulk B cell subset transcripts for evidence of EBV and HERV replication. Using scRNA-Seq, we show that in MS patients, the CSF has a remarkably different cellular profile from the blood. We found very few B cells in healthy CSF, barely enough to be detected with current single-cell technology, extending previous reports using flow cytometry (53, 54). In addition, we found an expansion of CD4+ and CD8+ T cells in the CSF. The myeloid population was also distinct, with blood CD14+ and CD16+ monocytes existing in a transcriptional continuum with what we broadly termed CSF macrophages and monocyte-derived dendritic cells. The identification of these CSF macrophages by scRNA-Seq mirrors what was found in recently published scRNA-Seq studies of CSF obtained from patients with HIV-1 infection, RRMS, and anti-myelin oligodendrocyte glycoprotein antibody disorder. These investigators variously called these cells “microglia,” “microglia like,” or “monocytes” (13, 46, 47). Like the study by Esaulova and colleagues (47), we also identified these cells in the CSF of all our HCs, illustrating that their presence is not necessarily indicative of a neuroinflammatory disease process. The CSF also contained a relative dearth of NK cells compared with the blood and a relative enrichment for rare plasmacytoid dendritic cells, a cell type associated with MS relapse and activated via TLR-9 to produce large amounts of type I interferon (IFN) (55). Further subtyping of these cells was beyond the scope of our study. Genetic variation in multiple genes in the NF-κB pathway has been identified by genome-wide association studies as a risk factor for MS (56, 57). We observed NF-κB complex activation in CSF memory (unswitched and switched) B cells. The canonical NF-κB pathway is triggered by signaling through the BCR, toll-like receptors, TREM1, and a variety of TNF receptor superfamily members, through the generation of RELA and NF-κB1 heterodimers (58, 59). TREM1, NF-κB complex, and RELA were predicted to be activated both in the bulk RNA-Seq and scRNA-Seq datasets. Additionally, in CSF SM B cells, CEBPB and ECSIT pathway members directly involved in the activation of NF-κB (58–61) were also predicted to be activated. CEBPB, the CCAAT/Enhancer Binding Protein β transcription factor, is activated by multiple inflammatory stimuli and promotes inflammation in experimental autoimmune encephalomyelitis (EAE) (60). CEBPB−/− mice are resistant to EAE with reduced lymphocyte and antigen presenting cell infiltration into the CNS. In addition, pathway and GO analyses of the bulk and single-cell datasets revealed an enrichment of terms associated with inflammation, activation, and fatty acid biosynthesis in CSF memory and plasmablast/plasma cells. Previous work indicated that inhibition of downstream biosynthesis of cholesterol promotes regulatory Th2 bias and reverses paralysis in EAE models (62, 63). While human trials did not demonstrate a clinical benefit from statins in CIS and RRMS (64–67), there are indications that statins may be beneficial in the secondary progressive form of the disease (68). Further work may identify novel therapeutic targets for MS related to the cholesterol biosynthesis pathway. We also found evidence for a compartmentalized CSF CD27hi plasmablast/plasma cell inflammatory profile in MS. IL16, a chemoattractant whose ligand is CD4, was significantly up-regulated both in the single-cell and bulk analyses, suggesting cross-talk between CSF plasmablast/plasma cells and CD4+ T cells (69). Additionally, in the bulk RNA-Seq analyses, the CSF plasmablast/plasma cells up-regulated IL21 receptor, SOX5, UBA7, and TBX21 and inhibited IL4, suggesting these cells display an inflammatory phenotype. UBA7, an ISG15 activating enzyme, has been previously reported in systemic lupus erythematosus to represent a distinct proinflammatory subset of B cells (70). TBX21, the master transcription factor for Th-1 cell differentiation and IFN-γ gene transcription, has been recently shown to promote transmigration of B cells into the CNS (71–73). Consistent with prior MS studies, genes associated with the SMAD/TGF-β1 signaling pathway were down-regulated in CSF plasmablast/plasma cells using both single-cell and bulk analyses (74). Furthermore, TOB1, a gene involved in the SMAD/TGF-β1 signaling pathway, was down-regulated in CSF plasmablast/plasma cells (75). Low TOB1 expression, previously only reported in T cells from MS subjects and here reported in B cells, has been associated with a higher risk of MS conversion from CIS to RRMS, and its ablation worsens disease severity in EAE models (76, 77). Collectively, these findings support the conclusion that antigen-experienced CSF B cells and plasmablast/plasma cells in MS are polarized toward an inflammatory phenotype compared with their peripheral counterparts. Previous immune repertoire studies in MS suggested that B cell maturation occurs both in the periphery and CNS with Ig class-switched B cells (IgG and IgM) providing an antigen-experienced immune axis connecting the two compartments (41, 78). In this study, we detected similar connections of highly similar B cells across the immune axis that, in a given person, are likely to be clonally related. Data from both bulk and single-cell cohorts support the finding that highly similar B cells are also shared across patients. In addition to observing B cell connections across the CSF/blood and blood/blood of subjects with RRMS, highly similar B cells were also identified between subjects in the OND group and HCs, with no significant difference in the proportion of highly similar B cells shared between MS subjects and individuals in the OND and HC groups. A recent study found that 1 to 6% of identical Ig heavy chain clones (identical IGHV, IGHJ, and H-CDR3) in the blood were shared between three HCs (79). Independent of the presence or absence of genetically similar BCRs across disease subjects and controls, more work needs to be done to characterize the antigenic specificity of expanded B cell populations in the CSF of RRMS patients and to determine whether genetically similar but not identical BCRs are indeed specific for the same antigen. Using bulk and scIg-Seq, we found that most CSF B cells are IgM+ or IgG1+. Using diversity analysis, we showed that both IgM+ and IgG1+ B cells in RRMS subjects are less diverse in the CSF and are clonally expanded. The number of clonally expanded B cells in the CSF of MS subjects correlated with the IgG index, consistent with what was seen with proteomics studies (80). Further, we linked clonal B cell expansion to gadolinium enhancement on MRI, suggesting CSF clonal B cell expansion is heightened during active demyelination and blood–brain barrier breakdown. This extends findings from a recent study that identified a connection between the IgG index and elevated CSF neurofilament light chain, a marker of axonal damage (81). There was increased SHM in CSF IgM+ cells with increased basic residues in the H-CDR3, consistent with neoantigen exposure in the CSF. Further, IgG1+ CSF B cells had longer H-CDR3 sequences relative to their blood counterparts, again suggestive of neoantigen exposure in the CNS. Finally, we showed that while there was a greater number of CSF IgM+ cells that had undergone SHM, these cells were also present in HCs, and so, their presence in low numbers is potentially a part of normal CNS immune surveillance. It has been hypothesized that EBV, the most consistent environmental risk factor for MS, may latently infect MS subjects’ B cells (82, 83). Some groups have identified EBV transcripts by real-time PCR, in situ hybridization, and immunohistochemistry in the CNS (84, 85), whereas others have not been able to detect EBV in these locations (86–88). We did not detect EBV transcripts (or other viral transcripts) in any of the B cell subsets using an unbiased metagenomics platform, even though EBV was detected in the latently EBV-infected control cell line EHEB at 10 pg RNA input, comparable with the amount of RNA in a typical single cell (45, 89, 90). While our findings do not rule out rare latent EBV infection with intermittent transcription, they do argue against constitutive EBV transcription in the B cells of MS subjects. Lastly, a variety of HERV transcripts were present in all of the B cell subsets we studied, but there was no difference in their expression between CSF and blood B cells. The significance of this finding is unclear without a comparable dataset generated from HC and OND cohorts. Our study has several limitations. It is possible that transcriptional profiles are altered by sample handling, although we did use a standardized protocol and processed samples immediately upon acquisition to minimize the potential for batch effect. For the bulk RNA-Seq cohort, we sequenced sorted CSF and blood B cell subsets based on traditional CD19, IgD, and CD27 surface markers. However, this is not the only strategy for defining B cell subsets, so the particular sorting protocol we utilized is important context for appropriately interpreting the transcriptomic profiles that are derived from these sorted cell populations. To account for the bias that arises from defining cell populations based on a priori selection of a limited number of cell surface markers, we also performed scRNA-Seq on unsorted CSF and blood from an independent cohort of MS patients and defined cell identity based on scRNA-Seq profiles alone. While we highlighted pathways that were differentially regulated between CSF and blood in both the bulk and single-cell datasets, the DGE analyses for each of the cohorts were carried out independently, and the data for each set of analyses are available in Dataset S4. Future studies should take advantage of recently developed protocols that allow for the simultaneous single-cell measurements of RNA transcripts and surface proteins using DNA barcoded antibodies (cellular indexing of transcriptomes and epitopes by sequencing or CITE-Seq) (91). Finally, we cannot conclude that the CSF B cell transcriptional signatures we identified are MS specific without additional OND controls and without normative RNA-Seq profiles of HC CSF B cells, which were not feasible to obtain here given the low numbers of CSF B cells in HC CSF, as has been seen in other studies (53, 54). Nevertheless, we have used the combination of bulk RNA-Seq and scRNA-Seq, including with full-length VDJ analysis, to identify a core set of transcriptional programs in antigen-experienced CSF B cells with respect to blood B cells in patients with treatment-naïve RRMS. In this study, we present a comprehensive profile of CNS-specific transcriptional B cell phenotypes in MS at single-cell resolution with paired immune repertoires. Further, we combine this with in-depth profiling of sorted B cell subsets. We reveal a polyclonal IgM and IgG1 CSF B cell expansion polarized toward an inflammatory, memory, and plasmablast/plasma cell phenotype, with differential up-regulation of specific proinflammatory pathways that can serve as potential therapeutic targets. Our data are consistent with a pathogenic role for CNS B cells, including evidence for active cross-talk with T cells in the CNS. We did not find evidence that CNS B cells harbor a neurotropic virus. Taken together, these data support the targeting of activated resident B cells in the CNS as a potentially effective strategy for control of treatment-resistant chronic disease.

Subject Consent.

All studies were approved by the UCSF Institutional Review Board, and written informed consent was obtained from each participant before inclusion in the study.
  91 in total

1.  Predictive modeling of therapy response in multiple sclerosis using gene expression data.

Authors:  Sara Mostafavi; Sergio Baranzini; Jorge Oksernberg; Parvin Mousavi
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2006

2.  Abrogation of T cell quiescence characterizes patients at high risk for multiple sclerosis after the initial neurological event.

Authors:  Jean-Christophe Corvol; Daniel Pelletier; Roland G Henry; Stacy J Caillier; Joanne Wang; Derek Pappas; Simona Casazza; Darin T Okuda; Stephen L Hauser; Jorge R Oksenberg; Sergio E Baranzini
Journal:  Proc Natl Acad Sci U S A       Date:  2008-08-08       Impact factor: 11.205

3.  Longitudinally persistent cerebrospinal fluid B cells can resist treatment in multiple sclerosis.

Authors:  Ariele L Greenfield; Ravi Dandekar; Akshaya Ramesh; Erica L Eggers; Hao Wu; Sarah Laurent; William Harkin; Natalie S Pierson; Martin S Weber; Roland G Henry; Antje Bischof; Bruce Ac Cree; Stephen L Hauser; Michael R Wilson; H-Christian von Büdingen
Journal:  JCI Insight       Date:  2019-03-21

4.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires.

Authors:  Jason A Vander Heiden; Gur Yaari; Mohamed Uduman; Joel N H Stern; Kevin C O'Connor; David A Hafler; Francois Vigneault; Steven H Kleinstein
Journal:  Bioinformatics       Date:  2014-03-10       Impact factor: 6.937

5.  Lymphocyte phenotype and subset distribution in normal cerebrospinal fluid.

Authors:  A Svenningsson; O Andersen; M Edsbagge; S Stemme
Journal:  J Neuroimmunol       Date:  1995-12       Impact factor: 3.478

6.  Absence of Epstein-Barr virus in the brain and CSF of patients with multiple sclerosis.

Authors:  S A Sargsyan; A J Shearer; A M Ritchie; M P Burgoon; S Anderson; B Hemmer; C Stadelmann; S Gattenlöhner; G P Owens; D Gilden; J L Bennett
Journal:  Neurology       Date:  2010-03-10       Impact factor: 9.910

7.  Increased CSF IgM in multiple sclerosis.

Authors:  A C Williams; E S Mingioli; H F McFarland; W W Tourtellotte; D E McFarlin
Journal:  Neurology       Date:  1978-10       Impact factor: 9.910

8.  Human immunoglobulin (Ig)M+IgD+ peripheral blood B cells expressing the CD27 cell surface antigen carry somatically mutated variable region genes: CD27 as a general marker for somatically mutated (memory) B cells.

Authors:  U Klein; K Rajewsky; R Küppers
Journal:  J Exp Med       Date:  1998-11-02       Impact factor: 14.307

9.  Single-cell RNA-seq analysis of human CSF microglia and myeloid cells in neuroinflammation.

Authors:  Ekaterina Esaulova; Claudia Cantoni; Irina Shchukina; Konstantin Zaitsev; Robert C Bucelli; Gregory F Wu; Maxim N Artyomov; Anne H Cross; Brian T Edelson
Journal:  Neurol Neuroimmunol Neuroinflamm       Date:  2020-05-05

10.  Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage.

Authors:  Dvir Aran; Agnieszka P Looney; Leqian Liu; Esther Wu; Valerie Fong; Austin Hsu; Suzanna Chak; Ram P Naikawadi; Paul J Wolters; Adam R Abate; Atul J Butte; Mallar Bhattacharya
Journal:  Nat Immunol       Date:  2019-01-14       Impact factor: 25.606

View more
  35 in total

Review 1.  The Application of Single-Cell RNA Sequencing in Studies of Autoimmune Diseases: a Comprehensive Review.

Authors:  Mingming Zhao; Jiao Jiang; Ming Zhao; Christopher Chang; Haijing Wu; Qianjin Lu
Journal:  Clin Rev Allergy Immunol       Date:  2020-11-25       Impact factor: 8.667

Review 2.  Genetics and functional genomics of multiple sclerosis.

Authors:  Woori Kim; Nikolaos A Patsopoulos
Journal:  Semin Immunopathol       Date:  2022-01-12       Impact factor: 9.623

3.  Deciphering the language of antibodies using self-supervised learning.

Authors:  Jinwoo Leem; Laura S Mitchell; James H R Farmery; Justin Barton; Jacob D Galson
Journal:  Patterns (N Y)       Date:  2022-05-18

4.  Benchmarking computational methods for B-cell receptor reconstruction from single-cell RNA-seq data.

Authors:  Tommaso Andreani; Linda M Slot; Samuel Gabillard; Carsten Strübing; Claus Reimertz; Veeranagouda Yaligara; Aleida M Bakker; Reza Olfati-Saber; René E M Toes; Hans U Scherer; Franck Augé; Deimantė Šimaitė
Journal:  NAR Genom Bioinform       Date:  2022-07-13

Review 5.  Breaching Brain Barriers: B Cell Migration in Multiple Sclerosis.

Authors:  Carla Rodriguez-Mogeda; Sabela Rodríguez-Lorenzo; Jiji Attia; Jack van Horssen; Maarten E Witte; Helga E de Vries
Journal:  Biomolecules       Date:  2022-06-07

6.  Gut microbiota-specific IgA+ B cells traffic to the CNS in active multiple sclerosis.

Authors:  Anne-Katrin Pröbstel; Xiaoyuan Zhou; Ryan Baumann; Sven Wischnewski; Michael Kutza; Olga L Rojas; Katrin Sellrie; Antje Bischof; Kicheol Kim; Akshaya Ramesh; Ravi Dandekar; Ariele L Greenfield; Ryan D Schubert; Jordan E Bisanz; Stephanie Vistnes; Khashayar Khaleghi; James Landefeld; Gina Kirkish; Friederike Liesche-Starnecker; Valeria Ramaglia; Sneha Singh; Edwina B Tran; Patrick Barba; Kelsey Zorn; Johanna Oechtering; Karin Forsberg; Lawrence R Shiow; Roland G Henry; Jennifer Graves; Bruce A C Cree; Stephen L Hauser; Jens Kuhle; Jeffrey M Gelfand; Peter M Andersen; Jürgen Schlegel; Peter J Turnbaugh; Peter H Seeberger; Jennifer L Gommerman; Michael R Wilson; Lucas Schirmer; Sergio E Baranzini
Journal:  Sci Immunol       Date:  2020-11-20

Review 7.  Human Herpesviruses 6A and 6B in Brain Diseases: Association versus Causation.

Authors:  Anthony L Komaroff; Philip E Pellett; Steven Jacobson
Journal:  Clin Microbiol Rev       Date:  2020-11-11       Impact factor: 26.132

8.  Epigenetic Plasticity Enables CNS-Trafficking of EBV-infected B Lymphocytes.

Authors:  Samantha S Soldan; Chenhe Su; R Jason Lamontagne; Nicholas Grams; Fang Lu; Yue Zhang; James D Gesualdi; Drew M Frase; Lois E Tolvinski; Kayla Martin; Troy E Messick; Jonathan T Fingerut; Ekaterina Koltsova; Andrew Kossenkov; Paul M Lieberman
Journal:  PLoS Pathog       Date:  2021-06-09       Impact factor: 6.823

Review 9.  The CD8 T Cell-Epstein-Barr Virus-B Cell Trialogue: A Central Issue in Multiple Sclerosis Pathogenesis.

Authors:  Caterina Veroni; Francesca Aloisi
Journal:  Front Immunol       Date:  2021-07-07       Impact factor: 7.561

Review 10.  B cells in multiple sclerosis - from targeted depletion to immune reconstitution therapies.

Authors:  Maria T Cencioni; Miriam Mattoscio; Roberta Magliozzi; Amit Bar-Or; Paolo A Muraro
Journal:  Nat Rev Neurol       Date:  2021-06-01       Impact factor: 42.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.