| Literature DB >> 35474892 |
Corin F Otesteanu1, Martina Ugrinic2, Gregor Holzner2, Yun-Tsan Chang3,4, Christina Fassnacht3,4, Emmanuella Guenova3,4, Stavros Stavrakis2, Andrew deMello2, Manfred Claassen5.
Abstract
The application of machine learning approaches to imaging flow cytometry (IFC) data has the potential to transform the diagnosis of hematological diseases. However, the need for manually labeled single-cell images for machine learning model training has severely limited its clinical application. To address this, we present iCellCnn, a weakly supervised deep learning approach for label-free IFC-based blood diagnostics. We demonstrate the capability of iCellCnn to achieve diagnosis of Sézary syndrome (SS) from patient samples on the basis of bright-field IFC images of T cells obtained after fluorescence-activated cell sorting of human peripheral blood mononuclear cell specimens. With a sample size of four healthy donors and five SS patients, iCellCnn achieved a 100% classification accuracy. As iCellCnn is not restricted to the diagnosis of SS, we expect such weakly supervised approaches to tap the diagnostic potential of IFC by providing automatic data-driven diagnosis of diseases with so-far unknown morphological manifestations.Entities:
Keywords: Sézary syndrome; cancer cell imaging; deep learning; high-throughput imaging; image flow cytometry; machine learning; peripheral blood mononuclear samples; weakly supervised learning
Year: 2021 PMID: 35474892 PMCID: PMC9017143 DOI: 10.1016/j.crmeth.2021.100094
Source DB: PubMed Journal: Cell Rep Methods ISSN: 2667-2375
Figure 1Workflow for label-free IFC-based clinical diagnostics
(A) Peripheral blood mononuclear cell samples are imaged using our IFC platform. The digitized cell images are preprocessed and then used to train a machine learning model, with labels either at a whole-sample level or (weak) labels at an individual image level. Multiple images are pooled together in a bag of cells, and used to train a classifier to provide a final diagnosis probability.
(B) Number of cell images acquired per patient sample.
(C and D) (C) Signal-to-noise ratio and contrast-to-noise ratio of the IFC images. (D) Example images of healthy donor cells and Sézary syndrome patient cells captured using both imaging flow cytometry and high-resolution electron microscopy. Scale bars, 5 μm.
Figure 2iCellCnn model architecture
The digitized cell images are preprocessed and used to train a machine learning model, with labels either at an individual image level or (weak) labels at the whole-sample level. A convolutional autoencoder is trained as a feature extractor to represent each single image in a latent space. Multiple images are pooled together in a bag of cells, which is represented in a latent space (bag of cell features), and used to train a random forest classifier to provide the final diagnosis probability.
Figure 3Predicted percentage of cells with atypical (Sézary) morphology in the blood of healthy and diseased specimens
This was achieved using (A) strong supervision using naive labels, (B) strong supervision with a subset of annotations on diseased cells, (C) bag of cells approach using weak supervision, and (D) group-wise amounts of healthy donor and Sézary syndrome patient predicted cells with atypical (Sézary) morphology.
(E) Patient-level classification accuracy, p values, and Hellinger distance between healthy and diseased probability distributions. The error-bars represent the test accuracy standard deviation across five training experiments.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| anti-human CD3 monoclonal antibody conjugated with PerCP | Miltenyi Biotec, Gladbach, Germany | clone BW264/56, Cat#130-096-910; RRID: |
| Sezary Syndrome patient PBMCs | University of Zürich Biobank | Biobank project (EK No. 647) |
| Healthy donor PBMCs | Blutspende SRK Zürich | N/A |
| SYTOX™ Red Dead Cell Stain | ThermoFisher Scientific | Cat#S34859 |
| Formaldehyde solution | Sigma-Aldrich, Buchs, Switzerland | Cat# 47608, CAS: 50-00-0 |
| Poly-L-lysin | Sigma-Aldrich, Buchs, Switzerland | Cat# P6282, CAS: 25988-63-0 |
| Glutaraldehyde solution | Sigma-Aldrich, Buchs, Switzerland | Cat# G7776, CAS:111-30-8 |
| Osmium tetroxide | Sigma-Aldrich, Buchs, Switzerland | Cat# 201030, CAS:20816-12-0 |
| Thiocarbohydrazide | Sigma-Aldrich, Buchs, Switzerland | Cat# 223220, CAS:2231-57-4 |
| Epoxy embedding medium | Sigma-Aldrich, Buchs, Switzerland | Cat# 45345 |
| Uranyl Acetate 98%, ACS Reagent | Polysciences | Ref: 45345 |
| IFC image data | This paper | |
| python 2.7 | Python Software Foundation | |
| tensorflow 1.7 | ||
| keras 2.1.5 | ||
| scikit-learn 0.19 | ||