| Literature DB >> 30557333 |
Yuzhen Zhao1, Xiyu Liu1, Xiufeng Li1.
Abstract
Density-based spatial clustering of applications with noise (DBSCAN) algorithm can find clusters of arbitrary shape, while the noise points can be removed. Membrane computing is a novel research branch of bio-inspired computing, which seeks to discover new computational models/framework from biological cells. The obtained parallel and distributed computing models are usually called P systems. In this work, DBSCAN algorithm is improved by using parallel evolution mechanism and hierarchical membrane structure in cell-like P systems with promoters and inhibitors, where promoters and inhibitors are utilized to regulate parallelism of objects evolution. Experiment results show that the proposed algorithm performs well in big cluster analysis. The time complexity is improved to O(n), in comparison with conventional DBSCAN of O(n2). The results give some hints to improve conventional algorithms by using the hierarchical framework and parallel evolution mechanism in membrane computing models.Entities:
Mesh:
Year: 2018 PMID: 30557333 PMCID: PMC6296794 DOI: 10.1371/journal.pone.0200751
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Membrane structure for the improved DBSCAN algorith.
The computational process of the example.
| step | membrane 0 | membrane 1 | membrane 3 | membrane 6 |
|---|---|---|---|---|
| 0 | ||||
| 1 |
| |||
| 2 |
| |||
| 4 |
| |||
| 5 | ||||
| 6 | ||||
| 7 | ||||
| 8 | ||||
| 9 | ||||
| 10 | ||||
| 11 | ||||
| 12 | ||||
| 13 | ||||
| 14 | ||||
| 15 | ||||
| 16 | ||||
| 17 | ||||
| 18 | ||||
| 19 |
Comparisons results of time complexity of some proposed DBSCAN algorithms.
| algorithm | time complexity |
|---|---|
| DBSCAN [ | |
| Rough-DBSCAN [ | |
| DBSCAN using a pruning technique on bit vectors [ | |
| A prototype-based modified DBSCAN [ | max{ |
| G-DBSCAN [ | |
| BDE-DBSCAN [ | |
| SS-DBSCAN [ | |
| DBSCAN based on grid cell [ | |
| DBSCAN with Spark [ | |
Fig 2The data points waiting for being clustered.
Fig 3The two clusters formed by the conventional algorithm.
The 3 clusters and noise points on Iris database using DBSCAN-CPPI algorith.
| Cluster | Serial number of data in the corresponding cluster |
|---|---|
| 1 | 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,43,44,45,46,47,48,49,50 |
| 2 | 51,52,53,54,55,56,57,59,60,62,64,66,67,68,70,72,74,75,76,77,78,79,80,81,82,83,85,86,87,89,90,91,92,93,95,96,97,98,100 |
| 3 | 71,73,84,102,103,104,105,111,112,113,114,116,117,121,122,124,125,126,127,128,129,130,133,134,137,138,139,140,141,142,143,144,145,146,147,148,149,150 |
| Noise points | 23,42,58,61,63,65,69,88,94,99,101,106,107,108,109,110, 115,118,119,120,123,131,132,135,136 |
Fig 4The banana shaped database.
Fig 5The 2 clusters and noise points with DBSCAN algorithm.
Fig 6The cluster accuracy of different parameter values in the Iris database obtained by DBSCAN-CPPI.
Fig 7The cluster accuracy of different parameter values in the banana database obtained by DBSCAN-CPPI.
The 3 clusters with k-means algorithm.
| Cluster | Serial number of data in the corresponding cluster |
|---|---|
| 1 | 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50 |
| 2 | 51,52,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,102,107,114,115,120,122,124,127,128,134,139, 143,147,150 |
| 3 | 53,78,101,103,104,105,106,108,109,110,111,112,113,116,117,118,119,121,123,125,126,129,130,131,132,133,135,136,137,138,140,141,142,144,145,146,148,149 |
Fig 8The 2 clusters with k-means algorithm on banana database.