| Literature DB >> 35592315 |
Yen Hoang1, Stefanie Gryzik1, Ines Hoppe1, Alexander Rybak1,2, Martin Schädlich1,2, Isabelle Kadner1,2, Dirk Walther3, Julio Vera4, Andreas Radbruch1,5, Detlef Groth2, Sabine Baumgart1,6, Ria Baumgrass1,2.
Abstract
Recently, mass cytometry has enabled quantification of up to 50 parameters for millions of cells per sample. It remains a challenge to analyze such high-dimensional data to exploit the richness of the inherent information, even though many valuable new analysis tools have already been developed. We propose a novel algorithm "pattern recognition of immune cells (PRI)" to tackle these high-dimensional protein combinations in the data. PRI is a tool for the analysis and visualization of cytometry data based on a three or more-parametric binning approach, feature engineering of bin properties of multivariate cell data, and a pseudo-multiparametric visualization. Using a publicly available mass cytometry dataset, we proved that reproducible feature engineering and intuitive understanding of the generated bin plots are helpful hallmarks for re-analysis with PRI. In the CD4+T cell population analyzed, PRI revealed two bin-plot patterns (CD90/CD44/CD86 and CD90/CD44/CD27) and 20 bin plot features for threshold-independent classification of mice concerning ineffective and effective tumor treatment. In addition, PRI mapped cell subsets regarding co-expression of the proliferation marker Ki67 with two major transcription factors and further delineated a specific Th1 cell subset. All these results demonstrate the added insights that can be obtained using the non-cluster-based tool PRI for re-analyses of high-dimensional cytometric data.Entities:
Keywords: combinatorial protein expression; high-dimensional cytometry data; mass cytometry data; multi-parametric analysis; pattern perception; re-analysis
Mesh:
Substances:
Year: 2022 PMID: 35592315 PMCID: PMC9110672 DOI: 10.3389/fimmu.2022.849329
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Figure 1Schematic comparison of the analysis approach with the re-analysis approach. Comparison of the analysis approach of Spitzer et al. (14) and the re-analysis of the data with PRI. Both studies analysed the same publicly available datasets on tumor treatments, while our re-analysis was limited to blood samples from day three from mice. For this dataset, there were a total of four different treatments, which can be divided into the two groups effective and ineffective. Within the ineffective treatment group there were animals that received no treatment (untreated - untr.) and those that received an ineffective (ineff.; anti-PD-1) treatment. In addition, there were effective treatment1 (effective 1; IFN-γ + anti-CD40 + CD1-allo-IgG) and effective treatment2 (effective 2; IFN-γ + anti-CD40 + B6-allo-IgG). For details on the experimental setting see Spitzer et al. The original analysis used common tools in flow cytometry, such as scaffold maps and citrus as well as clustering to reduce dimensions. As a result, the authors found changes in the frequencies of clustered cell subsets. In our bin-based re-analysis (PRI approach) we found different patterns and properties of grouped cell subsets concerning frequencies and MSI (mean signal intensity).
Figure 2Flowchart of the used PRI-analysis workflow. (A) The flowchart lists the step by step details. (B) Gating strategy focusing on selecting CD4 T helper cells – plots from left to the right show the identification of live single cells by using the signal of the iridium-DNA-intercalator and the negative expression of cisplatin represented by 195Pt. Left-over internal standard beads were excluded by using high 140Ce signal. CD45 as unique pan-leukocytic marker, and CD3 were used to detect T cells. The very right plot shows the discrimination of CD4 T helper cells from CD8 cytotoxic T cells.
Figure 3Identification of intensity bin-patterns and bin-values for the classification of mice receiving treatments with differential effects. (A) Comparison of the density and mean signal intensity (MSI)-bin patterns of two exemplary samples (untreated blood1 and effective 2 blood2) using semi-continuous binning with CD90 (x-axis) and CD44 (y-axis) and dynamic intensity ranges for z. (B) Statistical comparison of three different metrics as z parameter in PRI per sample: the frequency of the quadrant cells (left graph; black quadrant numbers of density plots), the maximal bin-MSI values of CD86 and CD27 (middle graph; max-bin values of the color-coded legend of MSI bin plots), and the frequencies of z+ cells per quadrants (right graph; red quadrant numbers of MSI-bin plots of all 11 samples). The ineffective treatment group with untreated (untr.) and ineffectively treated (ineff.) samples is shown in gray and the effective treatment group with effective treatment 1 and 2 in black. P < 0.05 was considered significant, with the numbers of asterisks indicating: *p ≤ 0.05; **p ≤ 0.01; ns stands for non-significant.
Figure 4Cell subset characterization using side by side plots and common ranges for each z-parameter. (A) Comparison of mean signal intensity (MSI)-bin patterns of two exemplary samples (untreated blood1 and effective 2 blood2) for seven z parameters using common intensity ranges for z (min-max of both samples). (B) Statistical comparison of the maximum bin-MSI values per sample (PRI-analysis) and the frequencies of the quadrant cells in Q3 and Q4 (red quadrant numbers of the MSI-bin plots) of all 11 samples for seven z parameters. The ineffective treatment group with untreated (untr.) and ineffectively treated (ineff.) samples is shown in gray and the effective treatment group with effective treatment 1 and 2 in black. P < 0.05 was considered significant, with the numbers of asterisks indicating: *p ≤ 0.05; **p ≤ 0.01.
Figure 5Analysis of the co-expression of the proliferation marker Ki67 and the master transcription factors Tbet and Foxp3. (A) Analysis of bin patterns for mean signal intensity of z+ cells (MSI+) using one exemplary sample for effective therapy (effective 2 blood2) for Ki67, Tbet and Foxp3 using dynamic ranges for each sample. (B) Pie charts show differences in Ki67 co-expression between the two exemplary samples (untreated blood1 and effective 2 blood2).
I Advantages of PRI.
| Characteristics | power of PRI | Comments |
|---|---|---|
| unsupervised | ++ | |
| reproducible results | ++ | |
| quantitative analysis (single cell resolution) | ++ | PRI: retained for analysis |
| intuitive interpretation | ++ | |
| simple statistical analysis (robust) | ++ | |
| sample comparison | ++ | |
| integration of experience and expertise | ++ | |
| no clustering-based method | ++ | |
| no-downsampling | ++ | downsampling is a disadvantage of many other methods |
| automatable | + | |
| low computing power | + | |
| rapid visualisation | + | |
| stand-alone application | + | PRI: after pre-processing of cytometry data (e.g. FlowJo) |
+ high power of PRI; ++ very high power of PRI.