| Literature DB >> 32495539 |
Wanyue Zhao1, Yingxue Guo1, Sigang Yang1, Minghua Chen1, Hongwei Chen1.
Abstract
SIGNIFICANCE: The use of optofluidic time-stretch flow cytometry enables extreme-throughput cell imaging but suffers from the difficulties of capturing and processing a large amount of data. As significant amounts of continuous image data are generated, the images require identification with high speed. AIM: We present an intelligent cell phenotyping framework for high-throughput optofluidic time-stretch microscopy based on the XGBoost algorithm, which is able to classify obtained cell images rapidly and accurately. The applied image recognition consists of density-based spatial clustering of applications with noise outlier detection, histograms of oriented gradients combining gray histogram fused feature, and XGBoost classification. APPROACH: We tested the ability of this framework against other previously proposed or commonly used algorithms to phenotype two groups of cell images. We quantified their performances with measures of classification ability and computational complexity based on AUC and test runtime. The tested cell image datasets were acquired from high-throughput imaging of over 20,000 drug-treated and untreated cells with an optofluidic time-stretch microscope.Entities:
Keywords: automatic cell detection; imaging cytometry; machine learning; time-stretch microscopy
Year: 2020 PMID: 32495539 PMCID: PMC7267411 DOI: 10.1117/1.JBO.25.6.066001
Source DB: PubMed Journal: J Biomed Opt ISSN: 1083-3668 Impact factor: 3.170
Fig. 1(a) Optofluidic time-stretch imaging cytometer system setup. EDFA, erbium-doped fiber amplifier. (b) Images of drug-treated (group 2) and untreated (group 1) MCF-7 cells under our optofluidic time-stretch microscope (flowing at a speed of ). Outlier samples are the images of noise samples such as bubbles and broken cells.
Fig. 2Flowchart of cell phenotyping steps.
Fig. 3Outlier sample detection principle diagram based on DBSCAN.
Comparison of outlier detection methods.
| DBSCAN | DENCLUE | LOF | None | |
|---|---|---|---|---|
| Accuracy | 0.9716 | 0.9522 | 0.9691 | 0.9574 |
| Runtime (s) | 88.72 | 494.30 | 544.49 | — |
Runtime comparison of feature extraction methods.
| High-dimensional feature | Low-dimensional feature | |||||
|---|---|---|---|---|---|---|
| PCA | LBP | Gabor | HOG | Gray histogram | Size | |
| Feature extraction time (s) | 912.16 | 49.96 | 964.64 | 111.03 | 15.9 | 45.92 |
| Classification time (s) | 0.243 | 1.388 | 2.38 | 3.787 | 0.06 | 0.04 |
Fig. 4The classification accuracy and AUC of high-dimensional features fused with low-dimensional features.
Fig. 5The iteration diagram of XGBoost.
Fig. 6The classification accuracy and AUC of classification algorithms with/without DBSCAN preprocessing.
Fig. 7The classification accuracy and AUC of classification algorithms under different size ratios between the training set and the test set.
Runtime of classification methods.
| XGBoost | SVM | LR | Optimized AlexNet | |
|---|---|---|---|---|
| HOG-gray extraction (s) | 111.03 | 111.03 | 111.03 | — |
| Image classification (s) | 3.589 | 86.84 | 2.72 | 165.6 |
| Total runtime (s) | 114.62 | 197.87 | 113.75 | 165.6 |
Fig. 8The classification accuracy and AUC of XGBoost under different ratios between the training set size of group 1 and group 2 with/without weight adjusting.
Fig. 9Image libraries of CACO2 and BT474 cells under optofluidic time-stretch microscope.
Comparison of feature extraction methods.
| PCA-gray | LBP-gray | Gabor-gray | HOG-gray | |
|---|---|---|---|---|
| Accuracy | 0.9205 | 0.9048 | 0.9199 | 0.9267 |
| Runtime (s) | 5.39 | 7.51 | 109.71 | 14.52 |
Comparison of classification methods.
| Accuracy with DBSCAN | Accuracy without DBSCAN | Runtime of model classification (s) | Total runtime (s) | |
|---|---|---|---|---|
| XGBoost | 0.9267 | 0.9114 | 0.406 | 14.93 |
| SVM | 0.8471 | 0.8289 | 14.08 | 28.60 |
| LR | 0.8739 | 0.8524 | 0.615 | 15.14 |
| Optimized AlexNet | 0.9239 | 0.9090 | 31.78 | 31.78 |