| Literature DB >> 30213126 |
Yue Wang1, Xiaochen Meng2, Lianqing Zhu3.
Abstract
The increased volume and complexity of flow cytometry (FCM) data resulting from the increased throughput greatly boosts the demand for reliable statistical methods for the analysis of multidimensional data. The Support Vector Machines (SVM) model can be used for classification recognition. However, the selection of penalty factor c and kernel parameter g in the model has a great influence on the correctness of clustering. To solve the problem of parameter optimization of the SVM model, a support vector machine algorithm of particle swarm optimization (PSO-SVM) based on adaptive mutation is proposed. Firstly, a large number of FCM data were used to carry out the experiment, and the kernel function adapted to the sample data was selected. Then the PSO algorithm of adaptive mutation was used to optimize the parameters of the SVM classifier. Finally, the cell clustering results were obtained. The method greatly improves the clustering correctness of traditional SVM. That also overcomes the shortcomings of PSO algorithm, which is easy to fall into local optimum in the iterative optimization process and has poor convergence effect in dealing with a large number of data. Compared with the traditional SVM algorithm, the experimental results show that, the correctness of the method is improved by 19.38%. Compared with the cross-validation algorithm and the PSO algorithm, the adaptive mutation PSO algorithm can also improve the correctness of FCM data clustering. The correctness of the algorithm can reach 99.79% and the time complexity is relatively lower. At the same time, the method does not need manual intervention, which promotes the research of cell group identification in biomedical detection technology.Entities:
Keywords: adaptive mutation PSO-SVM; biomedicine; cell clustering; flow cytometry; fluorescent reagent; supervised clustering
Year: 2018 PMID: 30213126 PMCID: PMC6162506 DOI: 10.3390/cells7090135
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1Generation principle of flow pulse signal.
Figure 2Support vector machine classification schematic diagram.
Figure 3Flow chart of PSO algorithm for adaptive mutation.
Figure 4Optimal individual fitness (a) PSO algorithm; (b) Adaptive mutation PSO algorithm.
Figure 5Experimental control group (a) Cell staining strategy; (b) Artificial clustering results.
Kernel function selection (c-1, g-0.7).
| Group | Linear Kernel | Polynomial Kernel | RBF Kernel | Sigmod Kernel | |||||
|---|---|---|---|---|---|---|---|---|---|
| Correctness % | SV | Correctness % | SV | Correctness % | SV | Correctness % | SV | ||
| 400 | 79.4118 (108/136) | 68 | 85.2941 (116/136) | 69 | 42.6471 (58/136) | 400 | 41.9118 (57/136) | 338 | |
| 800 | 77.2189 (261/338) | 132 | 77.2189 (261/338) | 132 | 34.0237 (115/338) | 800 | 34.0237 (115/338) | 670 | |
| 1200 | 75.25 (301/400) | 155 | 81 (324/400) | 195 | 41.75 (167/400) | 1200 | 41.75 (167/400) | 1015 | |
| 1600 | 72.8464 (389/534) | 200 | 83.1461 (444/534) | 237 | 47.191 (252/534) | 1600 | 43.4457 (232/534) | 1336 | |
| 2000 | 80.3598 (536/667) | 237 | 79.1604 (528/667) | 284 | 43.4783 (290/667) | 2000 | 43.4783 (290/667) | 1670 | |
| 2400 | 78.125 (625/800) | 245 | 78.75 (630/800) | 342 | 43.375 (347/800) | 2400 | 43.375 (347/800) | 2005 | |
| 2800 | 75.0268 (700/933) | 302 | 78.135 (729/933) | 359 | 43.4084 (405/933) | 2800 | 43.4084 (405/933) | 2338 | |
| 3200 | 78.2364 (834/1066) | 328 | 80.7692 (861/1066) | 407 | 43.4334 (463/1066) | 3200 | 43.4334 (463/1066) | 2671 | |
| 3600 | 77.0642 (924/1199) | 330 | 78.3153 (939/1199) | 428 | 43.4529 (521/1199) | 3420 | 43.4529 (521/1199) | 2649 | |
Figure 6The correctness of four kernel functions.
Figure 7Number of support vectors for four kernel functions.
Kernel function selection evaluation index.
| Kernel | Linear | Polynomial | RBF | Sigmod |
|---|---|---|---|---|
| Average Correctness % | 78.4139 | 80.4044 | 42.1814 | 42.0310 |
| Average SV | 212 | 267 | 1980 | 1632 |
Comparison of evaluation indexes between polynomial and RBF kernel function.
| Order | Kernel |
|
| CVAcc % | Correctness % | nSVtotal | t/s |
|---|---|---|---|---|---|---|---|
| 1 | Polynomial | 74.4406 | 0.1870 | 99.3219 | 99.1192 | 237 | 497.8755 |
| 2 | RBF | 74.5659 | 0.1412 | 99.3174 | 99.1848 | 250 | 589.0584 |
Figure 8Clustering results of human peripheral blood cells (a) Fitness curve; (b) Clustering results.
Figure 9Clustering results of Lymphocyte (a) Fitness curve (b) Clustering results.
Comparison of evaluation index of parameter optimization algorithm.
| Order | Algorithm |
|
| CVAcc % | Correctness % | nSVtotal | t/s |
|---|---|---|---|---|---|---|---|
| 1 | SVM | 1 | 0.7 | - | 80.4044 | 273 | 34.6353 |
| 2 | CV | 11.5648 | 1.3738 | 99.1797 | 99.2015 | 132 | 505.5738 |
| 3 | PSO-SVM | 82.1829 | 0.1412 | 99.3174 | 99.1848 | 250 | 589.0584 |
| 4 | Adaptive mutation PSO-SVM | 74.5659 | 0.01 | 99.6629 | 99.7853 | 249 | 489.5275 |