| Literature DB >> 31081599 |
Mariam Nassar1, Minh Doan2, Andrew Filby3, Olaf Wolkenhauer1,4, Darin K Fogg5, Justyna Piasecka6, Catherine A Thornton6, Anne E Carpenter2, Huw D Summers6, Paul Rees6, Holger Hennig1,2.
Abstract
White blood cell (WBC) differential counting is an established clinical routine to assess patient immune system status. Fluorescent markers and a flow cytometer are required for the current state-of-the-art method for determining WBC differential counts. However, this process requires several sample preparation steps and may adversely disturb the cells. We present a novel label-free approach using an imaging flow cytometer and machine learning algorithms, where live, unstained WBCs were classified. It achieved an average F1-score of 97% and two subtypes of WBCs, B and T lymphocytes, were distinguished from each other with an average F1-score of 78%, a task previously considered impossible for unlabeled samples. We provide an open-source workflow to carry out the procedure. We validated the WBC analysis with unstained samples from 85 donors. The presented method enables robust and highly accurate identification of WBCs, minimizing the disturbance to the cells and leaving marker channels free to answer other biological questions. It also opens the door to employing machine learning for liquid biopsy, here, using the rich information in cell morphology for a wide range of diagnostics of primary blood.Entities:
Keywords: high-content analysis; imaging flow cytometry; label-free classification; liquid biopsy; lymphocytes; machine learning; personalized medicine; white blood cell count; white blood cells
Year: 2019 PMID: 31081599 PMCID: PMC6767740 DOI: 10.1002/cyto.a.23794
Source DB: PubMed Journal: Cytometry A ISSN: 1552-4922 Impact factor: 4.355
Figure 1Workflow for selecting cell images for machine learning. Images captured by imaging flow cytometry were curated by human experts to remove out‐of‐focus events and artifacts (a), (b). In‐focus events (c) were then analyzed for exclusion of debris (d), and doublets or coincident events (e). Images of single cells in sharp focus (f) were then saved as .CIF files for later inputs of CellProfiler. Abbreviations: BF = Brightfield; DF = Darkfield. The fluorescence channel shown is CD15 in this example. Similar gating was used for other surface marker stains. Scale bar is 10 μm).
Figure 2The steps of the developed workflow combining imaging flow cytometry and machine learning to classify WBCs.
Figure 3T‐SNE visualization of the training data set.
Highest ranked morphological features for WBC classification. The table shows the most important features used by gradient boosting for the WBC main types classification using random undersampling. Detailed explanation of the features can be found in the CellProfiler user manual available at http://cellprofiler.org/manuals/
| Feature | Channel |
|---|---|
| MAD intensity | Darkfield |
| Std intensity | Darkfield |
| Integrated intensity | Darkfield |
| Lower quartile intensity | Brightfield |
| Granularity 1 | Darkfield |
| Mean Intensity | Brightfield |
| Upper quartile intensity | Darkfield |
| Granularity 1 | Brightfield |
| Std intensity edge | Brightfield |
| Integrated intensity edge | Darkfield |
The first column contains the feature names and the second column contains the associated channel. Features were measured in the entire cell (no subcompartments of cells were defined). The features were sorted by their frequencies in 10 random undersampling test runs.
Highest ranked morphological features for lymphocyte classification. The table shows the most important features used by gradient boosting for lymphocyte classification using random undersampling
| Feature | Channel |
|---|---|
| Std intensity edge | Brightfield |
| Lower quartile intensity | Brightfield |
| MeanFrac Radial Distribution 4of4 | Brightfield |
| Mean intensity | Brightfield |
| Integrated intensity edge | Darkfield |
| Granularity 1 | Brightfield |
| FracAtD Radial Distribution 4of4 | Brightfield |
| Granularity 1 | Darkfield |
| DifferenceVariance Texture 3_0 | Brightfield |
| Granularity 3 | Brightfield |
The first column contains the feature names and the second column contains the associated channel. Features were measured in the entire cell (no subcompartments of cells were defined). The features were sorted by their frequencies in 10 random undersampling test runs.
Figure 4Average WBC count over 85 unstained blood donors compared with the average WBC count range.