| Literature DB >> 35615019 |
Ivan Belyaev1,2, Alessandra Marolda3, Jan-Philipp Praetorius1,2, Arjun Sarkar1,2, Anna Medyukhina1,4, Kerstin Hünniger3,5, Oliver Kurzai3,5,6, Marc Thilo Figge1,7.
Abstract
Rapid identification of pathogens is required for early diagnosis and treatment of life-threatening bloodstream infections in humans. This requirement is driving the current developments of molecular diagnostic tools identifying pathogens from human whole blood after successful isolation and cultivation. An alternative approach is to determine pathogen-specific signatures from human host immune cells that have been exposed to pathogens. We hypothesise that activated immune cells, such as neutrophils, may exhibit a characteristic behaviour - for instance in terms of their speed, dynamic cell morphology - that allows (i) identifying the type of pathogen indirectly and (ii) providing information on therapeutic efficacy. In this feasibility study, we propose a method for the quantitative assessment of static and morphodynamic features of neutrophils based on label-free time-lapse imaging data. We investigate neutrophil activation phenotypes after confrontation with fungal pathogens and isolation from a human whole-blood assay. In particular, we applied a machine learning supported approach to time-lapse microscopy data from different infection scenarios and were able to distinguish between Candida albicans and C. glabrata infection scenarios with test accuracies well above 75%, and to identify pathogen-free samples with accuracy reaching 100%. These results significantly exceed the test accuracies achieved using state-of-the-art deep neural networks to classify neutrophils by their morphodynamics.Entities:
Keywords: Bloodstream infection; Candida infection; Diagnostic markers; Image analysis; Machine learning; Whole blood infection model
Year: 2022 PMID: 35615019 PMCID: PMC9120255 DOI: 10.1016/j.csbj.2022.05.007
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1Time-dependent change of a single neutrophil during 20 consecutive frames (arrows indicate the time ordering). Cells in sub-images B3–E3 can be considered as spreading cells (S-morphology).
Fig. 2Fraction of cells repeatedly identified as exhibiting S-morphology in each repetition of the Monte-Carlo simulations. Each sample includes O(104) segmented cell images.
Fig. 3a) Box diagrams for the fraction of spreading cells per video frame (260 frames in total) for each donor. b) Median value of the distributions in a) per donor. ** p = 0.0027, *** p ≪ 10−4 (Quade test with post-hoc analysis and p adjustment by Holm). The effect size statistics is listed in the Table 1. c) Confusion matrix for the results of the Bayesian classifications based on individual frames. Each cell of the matrix represents the ratio of proper sample classifications (numerator) for given infection scenarios over all iterations (denominator). d) Confusion matrix for the results of a sample classification based on description of whole video data.
Comparison of effect sizes expressed via common language effect size (CLES) and Hedge's for median fraction of spreading cells (CLESfrac, ) and for average speed per sample (CLESspeed, ). Details about calculations are described in paragraph Effect size statistics in Materials and Methods section.
| Pair for comparison | CLESfrac | CLESspeed | ||
|---|---|---|---|---|
| ‘mock’–‘ | 1 | 0.91 | 3.18 | 1.58 |
| ‘mock’–‘ | 1 | 0.94 | 5.47 | 2.13 |
| ‘ | 0.90 | 0.63 | 1.67 | 0.35 |
Fig. 4Diagrams of the average speed per cell (a) and per donor (b). The number of data points per sample is O(104), length of whiskers is not larger than 1.5 interquartile interval. For data in (b) the Quade statistical test was applied with post-hoc analysis and p adjustment by Holm: * p = 0.1265, ** p = 0.0224, *** p = 0.0011. The effect size statistics is listed in the Table 1.
Fig. 5Distributions of instantaneous speed for spreading and non-spreading neutrophils for a joint sample sets after confrontation with (a) C. albicans and (b) C. glabrata. Each joint sample set (represented by an individual curve) includes 9 × 103 data points composed of data from randomly selected 1 × 103 spreading cells (dashed lines) or an equal amount of non-spreading cells (solid lines) from each video. c) Shift functions presented by the difference between deciles of distributions in (a) and (b), respectively. The whiskers indicating the 0.95 bootstrap CI (for details see subsection Comparison of cell characteristics for different infection scenarios in the Materials and Methods section). d) Scatter diagram demonstrating the correlation between the median fraction of spreading cells per frame and median average speed per cell for the same sample.
Fig. 6Comparison of morphodynamics descriptors for joint populations of C. albicans- or C. glabrata-infected neutrophils by a box plot with whiskers indicating the whole range of values as well as decile-difference diagrams with whiskers indicating 0.95 bootstrap CI (see subsection Comparison of cell characteristics for different infection scenarios in the Materials and Methods section). All diagrams were built using balanced sampling (for details see subsection Data set organisation and sampling procedures in the Materials and Methods section). a) Distributions of the normalised number of transitions between non-spreading and spreading state. b) Distributions of the total amount of time that cells remain in state with S-morphology. c) Distributions of durations of the longest spreading episode per cell track.
Fig. 7a) Confusion matrix for sample classification results based on the fraction of neutrophil tracks with pathogen-specific morphodynamics. b) The typically detected fraction of cells with pathogen-specific morphodynamics in a given sample. The mean value is computed over all iterations and the whiskers indicate 0.95 CI for the detected fraction (see Confidence intervals for proportions in the Materials and Methods section). The number indicates a probability of the error type II for fraction of neutrophils with C. albicans- or C. glabrata-specific morphodynamics in a given sample. The symbol NA was used where the computation of this probability is not possible. For further details see subsection Post-hoc analysis of errors of type II in the Materials and Methods section.