| Literature DB >> 27693386 |
Rachael J M Bashford-Rogers1, Anne L Palser2, Clare Hodkinson3, Joanna Baxter3, George A Follows4, George S Vassiliou2, Paul Kellam5.
Abstract
Chronic lymphocytic leukemia (CLL) is characterized by the accumulation of clonally derived mature CD5high B cells; however, the cellular origin of CLL is still unknown. Patients with CLL also harbor variable numbers of CD5low B cells, but the clonal relationship of these cells to the bulk disease is unknown and can have important implications for monitoring, treating, and understanding the biology of CLL. Here, we use B-cell receptors (BCRs) as molecular barcodes to first show by single-cell BCR sequencing that the great majority of CD5low B cells in the blood of CLL patients are clonally related to CD5high CLL B cells. We investigate whether CD5 state switching was likely to occur continuously as a common event or as a rare event in CLL by tracking somatic BCR mutations in bulk CLL B cells and using them to reconstruct the phylogenetic relationships and evolutionary history of the CLL in four patients. Using statistical methods, we show that there is no parsimonious route from a single or low number of CD5low switch events to the CD5high population, but rather, large-scale and/or dynamic switching between these CD5 states is the most likely explanation. The overlapping BCR repertoires between CD5high and CD5low cells from CLL patient peripheral blood reveal that CLL exists in a continuum of CD5 expression. The major proportion of CD5low B cells in patients are leukemic, thus identifying CD5low B cells as an important component of CLL, with implications for CLL pathogenesis, clinical monitoring, and the development of anti-CD5-directed therapies.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27693386 PMCID: PMC5261558 DOI: 10.1016/j.exphem.2016.09.010
Source DB: PubMed Journal: Exp Hematol ISSN: 0301-472X Impact factor: 3.084
Patient sample information
| Patient | White blood cell count (×109 cells/L) | CLL Binet Stage | Treatment status |
|---|---|---|---|
| CLL 1 | 70.3 | A | Untreated |
| CLL 2 | 102.4 | C | Untreated |
| CLL 3 | 95.3 | A | Chlorambucil, rituximab completed 2 years prior to sampling |
| CLL 4 | 69.1 | C | Untreated |
| Healthy individuals 1–4 | Healthy range | NA | NA |
NA = not applicable.
Patient clinical information
| Patient | Genomic abnormality | IgHV mutation status (% identity, bp differences) | Progression of disease |
|---|---|---|---|
| CLL 1 | 13q14.3 deletion | Mutated (96.6%, 7bp) | Progressive, chlorambucil then relapse, bendamustine/rituximab responder. |
| CLL 2 | 13q deletion | Unmutated (99.1%, 2 bp) | Progressive at time of sampling, chlorambucil then relapse, then rituximab/chlorambucil nonresponder. |
| CLL 3 | Bi-allelic 13q deletion | Unmutated (99.5%, 1 bp) | Partial remission at time of sampling after rituximab/chlorambucil, went onto idelalisib/rituximab after sampling, but relapsed. |
| CLL 4 | Not detected | Mutated (94.2%, 11bp) | Progressive at time of sampling, rituximab nonresponder. |
Patients were selected for this study presented with a range of different prognostic factors and follow up information to ensure that the trends observed here could be generalized to multiple CLL subtypes. CLL 1, 2, and 3 exhibited 13q deletions, which are associated with good prognosis [1], and CLL 4 showed no known genomic abnormalities. A further prognostic factor is the IgHV mutation status, where unmutated IgHV (displayed in CLL 1 and 4) has a significantly inferior prognosis to mutated IgHV (displayed in CLL 2 and 3).
Patient clinical information
| Marker type | Genomic/chromosomal markers | Relative prognosis | Reference |
|---|---|---|---|
| Deletions | Deletions in 11q, 17p | Poor | |
| Deletions | Deletions in 13q | Good | |
| Deletions | Deletion in 6q | Intermediate | |
| Mutations | TP53, ATM (tumor suppressor genes) | Poor | |
| Mutations | IRF4, Bcl-2 polymorphism | Good | |
| Mutations | Bcl-6 mutation | Poor | |
| Mutations | MDM2 SNP | Poor | |
| IgVH mutational status | IgVH mutated | Good | |
| IgVH unmutated | Poor | ||
| Gene expression | ZAP-70 (correlates with mutational status) | Poor | |
| V3-21 gene usage | Poor | ||
| MicroRNAs | MicroRNA signature associated with prognosis | - | |
| Telomere length | Longer telomere length (correlates with mutational status) | Good |
Supplementary Figure E1Comparative analysis of CD5high and CD5low B-cell populations in CLL patients and healthy individuals. (A) Peripheral blood mononuclear cell-surface expression of CD5 and CD20. CLL patients showed a higher proportion of CD20+ B cells with high CD5 cell-surface expression compared to healthy individual samples. One representative sample is shown for each. (B) The percentages of CD5high and CD5low B cells (CD20+) in four CLL patients and four healthy individuals, with unpaired t-test p values indicated above sample groups. Error bars indicate mean + SD.
Percentage of CD5high and CD5low B cells of total B cells in CLL and healthy patients’ peripheral blood
| Single-cell experiment | Bulk-cell experiment | Total B cells | |||
|---|---|---|---|---|---|
| CD5low | CD5high | CD5low | CD5high | ||
| CLL 1 | 1.551 | 98.449 | - | - | 6381 |
| CLL 2 | 3.208 | 96.792 | - | - | 6826 |
| CLL 3 | 2.968 | 97.032 | - | - | 6975 |
| Healthy individual 1 | 95.376 | 4.624 | - | - | 757 |
| Healthy individual 2 | 92.580 | 7.420 | - | - | 283 |
| CLL 4 | - | - | 1.751 | 98.249 | 9993 |
| Healthy individual 3 | - | - | 92.837 | 7.163 | 2080 |
| Healthy individual 4 | - | - | 94.315 | 5.685 | 1970 |
Total number of sorted CD19+CD20+CD5+/− B cells.
Figure 1Comparative analysis of CD5high and CD5low B-cell populations in CLL patients and healthy individuals. (A) Bulk-cell total PBMC BCR sequencing networks for CLL patients 1-3 with corresponding maximum cluster sizes and IgHV-J gene usage. Samples yielded 112,722, 222,801, and 151,777 BCR sequences for CLL patients 1, 2, and 3, respectively (after filtering for Ig similarity, length, and primer sequences removal according to Bashford-Rogers et al. [14]). Sequencing networks are presented such that each vertex represents a unique BCR sequence in which relative vertex size is proportional to the number of identical sequence reads. Edges were generated between vertices that differed by single-nucleotide, non-indel differences and clusters were collections of related, connected vertices. The largest cluster sizes (CLL clusters) and corresponding IgHV-J combinations are indicated above and below the networks, respectively. (B) Single-cell analysis of CD5high or CD5low B-cell populations: single-cell BCR sequencing of CD5high and CD5low B cells from CLL patients 1, 2, and 3 was used to determine the frequencies of CLL cells in each B-cell subset. The number and percentage of CD5high or CD5low B cells expressing BCRs matching (i.e., maximum of 3 bp difference from the dominant CLL BCR sequence) the CLL clone for each CLL patient are indicated.
Supplementary Figure E2Optimizing gates for single-cell FACS sorting using beads. Polystyrene microparticle beads were mixed with the fluorochrome-conjugated human antibodies to optimize fluorescence gating settings for flow cytometric single-cell sorting of CD5high and CD5low B cells. After gating of monomeric microparticle beads, indicated by the red gates in part (i) in each panel, the (A) CD5high and CD5low gates (red and green, respectively) were set around the positive and negative control CD5 microparticle beads, (B) CD19 positive gates (blue), and (C) CD20 positive gates (blue).
Supplementary Figure E3Single-cell flow sorting of CD5high and CD5low B cells. The gating strategy used for identification of CD5high and CD5low mature peripheral blood B cells in the single-cell experiments for (A) CLL patient 1, (B) CLL patient 2, (C) CLL patient 3, (D) healthy individual 1, and (E) healthy individual 2. After exclusion of debris using light scatter features of leukocytes (panels (i)) and exclusion of dead cells (panels (ii)), mature B cells were identified as CD19+CD20+ (panels (iii)). Events captured in the red gate in panels (iv) were identified as CD5high, and those captured in green gate were identified as CD5low. Gates were chosen based on bead experiments (Supplementary Fig. E1).
Percentage of reads from the maximum BCR clone in the CD5high and CD5low B-cell populations in CLL and healthy individuals
| Sequencing method | % CLL sequences | Total number of sequences | |||
|---|---|---|---|---|---|
| CD5low | CD5high | CD5low | CD5high | ||
| CLL 1 | Single-cell | 76.56 | ND | 64 | ND |
| CLL 2 | Single-cell | 96.34 | 100.00 | 82 | 80 |
| CLL 3 | Single-cell | 98.15 | 97.62 | 54 | 42 |
| CLL 4 | Bulk-cell | 68.80 | 99.85 | 109775 | 397469 |
| Healthy individual 3 | Bulk-cell | NA | NA | 206201 | 149673 |
| Healthy individual 4 | Bulk-cell | NA | NA | 194555 | 11375 |
ND = not determined; NA = not applicable.
Sequences within 5bp of sequences in the largest cluster.
Single-cell sequencing performed by Sanger sequencing and bulk-cell sequencing performed by MiSeq.
Supplementary Figure E4Bulk-cell flow sorting of CD5high and CD5low B cells. The gating strategy used for identification of CD5high and CD5low mature peripheral blood B cells in the bulk-cell experiments for (A) CLL patient 4, (B) healthy individual 3, and (C) healthy individual 4. After exclusion of debris using light scatter features of leukocytes (panels (i)) and exclusion of dead cells (panels (ii)), mature B cells were identified as CD20+ (panels (iii)). Events captured in the dark-blue gate in panels (iii) were identified as CD5high, and those captured in light-blue gate were identified as CD5low.
Figure 2Comparison of CD5high and CD5low B-cell populations. Shown are plots of the frequencies of individual BCRs between CD5low and CD5high B-cell populations for healthy patient 1 (A), healthy patient 2 (B), and CLL patient 4 (C) as determined by sequencing the BCR repertoires using MiSeq. For CLL patient 4, in whom there is an overlap between CD5low and CD5high samples, the least-squares regression line equation and R2 value was determined for all BCRs, as indicated. (D) Theoretical model of CD5-state switching: unidirectional switch from CD5high to CD5low (scenario A), unidirectional switch from CD5high to CD5low (scenario B), and bidirectional dynamics switching/alternating between states (scenario C). These processes can be either a rare event (occurring only from a single or low number of CLL B cells) or a common/continuous event (occurring from a large percentage of the CLL B-cell population). (E) Determining the dynamics of CD5 state-switching in CLL patient 4: BCR sequence networks for total PBMCs (Ei) and BCR sequence networks for separated CD5high B cells (Eii, left) and CD5low B cells (Eii, right). The dominant cluster sizes are indicated above each plot. (Eiii) Combined maximum parsimony phylogenetic tree of the CLL cluster generated using the combined CLL BCR sequences from the separated CD5high and CD5low B-cell populations from (Eii). The branch lengths are proportional to the number of base differences from the central BCR sequence (evolutionary distance) and the resulting tree tips are colored red if the BCR was observed in CD5low B cells only, blue if observed in CD5high B cells only, and green if observed in both CD5low and CD5high B cells. Bootstrapping was performed to evaluate the reproducibility of the trees, showing strong tree support (>90% certainty for all branches). The overlap between CD5low and CD5high CLL BCR populations was significantly greater than that expected by a rare CD5 state-switching event originating from the central BCR (p < 10−10).
Maximum cluster sizes as a percentage of total BCR reads for CLL 4, healthy individual 3, and healthy individual 4
| Maximum cluster size (% of total network) | ||
|---|---|---|
| CD5high B cells | CD5low B cells | |
| CLL 4 | 99.85 | 68.8 |
| Healthy individual 3 | 4.62 | 2.19 |
| Healthy individual 4 | 21.17 | 3.1 |
Supplementary Figure E5Analysis A.of the nonleukemic counterpart of CD5+/– B-cell populations in CLL patients. (A) Comparisons of the largest cluster sizes (as a percentage of total reads) and (B) hierarchical clustering of IgHV-J gene usage similarities between CD5high (red) and CD5low (green) B-cell populations for two healthy individuals and CLL patient 4 (either for all reads in the sample, or all reads that were unrelated to the CLL clone, defined as >30 bp differences and/or >10 gaps within an alignment from the CLL clonal sequence). The CD5high and CD5low samples from patient 4 have increased clonality compared to healthy individuals (Supplementary Fig. E5A). However when the sequences related to the CLL clone are removed from the CD5high and CD5low samples (i.e., the nonleukemic counterpart of the repertoire), there is no significant difference in clonality of V-J gene usage frequencies. Both CD5high and CD5low samples from CLL patient 4, when excluding the CLL clone, cluster together with the healthy individuals.
Probabilities of BCR repertoire overlap between CD5high and CD5low populations in patient CLL 4 occurring by chance given a single switch event
| Distance from central BCR | Number of CD5low BCRs | BCRs observed in both CD5low and CD5high populations | Number of CD5high BCRs | |
|---|---|---|---|---|
| 1 | 613 | 581 | 584 | <10 × 10–10 |
| 2 | 3749 | 522 | 714 | <10 × 10–10 |
| 3 | 486 | 47 | 69 | <10 × 10–10 |
| 4 | 28 | 1 | 3 | 1.05 × 10–8 |
| 5 | 1 | 1 | 2 | <10 × 10–10 |
Supplementary Figure E6Testing the sub-clustering of CD5high and CD5low BCR sequences in the CLL BCR sequences in CLL patient 4. (A) Histogram of the nucleotide distances between any two sequences within the CD5low (red) or CD5high (green) or between the sequences of the CD5high and CD5low subsets (blue). The three subsets showed indistinguishable distributions of nucleotide distance distributions suggesting that the CD5high and CD5low CLL B cells are co-clustered. (B) To determine whether the CD5low and CD5high sequences subcluster into distinct groups on the phylogenetic tree (where the null hypothesis is that they are randomly distributed on the phylogenetic tree), the distances were randomly sampled at the same proportions, and the histogram of distance ratios of within groups versus between groups was calculated for each iteration. The p value for which the observed data distance ratio was nonrandom is not significant (bootstrapped p = 0.509), suggesting that the null hypothesis should be accepted and the sequences that generate the pairwise distances are randomly distributed on the tree (i.e., there is no parsimonious route to a single or low number of switch events).
Supplementary Figure E7Testing the cell surface expression differences between CD5high and CD5low from 4 CLL patients. (A) The gating strategy used for identification of CD5high and CD5low peripheral blood B cells from PBMC samples, where CD5high cells were defined as greater than 1.05 × (median CD5 fluorescence in the CD19+CD45+ lymphocytes, green gate) and CD5low were defined as less than 0.95 × (median CD5 fluorescence in the CD19+CD45+ lymphocytes, blue gate). (B) The corresponding boxplots of the mean of fluorescence (MFI) between the CD5high and CD5low B-cell populations for (i) CD81, (ii) CD22, (iii) CD38, (iv) CD20, (v) CD19, and (vi) CD45. p values were calculated as paired two-tailed t-tests. (C) Violin plots of the numbers of mutations in the CD5high and CD5low BCR sequences from the sequencing data in Figure 2. **** p value < 10−6. Flow-cytometric fluorescence intensities of fluorophore-conjugated antibodies against a set of CLL-associated B-cell surface antigen (namely CD81, CD22, CD38, CD20, CD19, and CD45) were used to quantify cell-surface expression in the CLL clone. Notably, the mean of fluorescence (MFI) for the expression of both CD45 and CD81 by circulating CLL B cells was significantly lower for CD5low B cells compared to CD5high B cells (paired t-test p values of 0.011 and 0.00023, respectively; Supplementary Fig. E7). CD45 is normally found on all leukocytes, where previous studies show CD45 plays a positive regulatory role during B-cell receptor signaling [13]. CD81 is usually expressed at high levels in normal germinal center B cells, and associated with mutated CLL. This is in agreement with our data, as the CD5high CLL B cells are associated with a higher number of mutations within the CLL clone as well as higher CD81 levels.