| Literature DB >> 26997530 |
Mohammed Ghesmoune1, Mustapha Lebbah2, Hanene Azzag3.
Abstract
Clustering data streams is becoming the most efficient way to cluster a massive dataset. This task requires a process capable of partitioning observations continuously with restrictions of memory and time. In this paper we present a new algorithm, called G-Stream, for clustering data streams by making one pass over the data. G-Stream is based on growing neural gas, that allows us to discover clusters of arbitrary shapes without any assumptions on the number of clusters. By using a reservoir, and applying a fading function, the quality of clustering is improved. The performance of the proposed algorithm is evaluated on public datasets.Keywords: Data stream clustering; GNG; Topological structure
Mesh:
Year: 2016 PMID: 26997530 DOI: 10.1016/j.neunet.2016.02.003
Source DB: PubMed Journal: Neural Netw ISSN: 0893-6080