Literature DB >> 36236746

A PID-Based kNN Query Processing Algorithm for Spatial Data.

Baiyou Qiao1, Ling Ma1, Linlin Chen1, Bing Hu1.   

Abstract

As a popular spatial operation, the k-Nearest Neighbors (kNN) query is widely used in various spatial application systems. How to efficiently process a kNN query on spatial big data has always been an important research topic in the field of spatial data management. The centralized solutions are not suitable for spatial big data due to their poor scalability, while the existing distributed solutions are not efficient enough to meet the high real-time requirements of some spatial applications. Therefore, we introduce the Proportional Integral Derivative (PID) control technology into kNN query processing and propose a PID-based kNN query processing algorithm (PIDKNN) for spatial big data based on Spark. In this algorithm, the whole data space is divided into grid cells of the same size using the grid partition method, and the grid-based index is constructed. On this basis, the grid-based density peak clustering algorithm is used to cluster spatial data, and the corresponding PID parameters are set for each cluster. When performing kNN queries, the PID algorithm is used to estimate the radius growth step size of kNN queries, thereby realizing kNN query processing with a variable query radius growth step based on a feedback mechanism, which greatly improves the efficiency of kNN query processing. A series of experimental results show that the PIDKNN algorithm has good performance and scalability and is superior to the existing parallel kNN query processing methods.

Entities:  

Keywords:  PID; Spark; density peak clustering; kNN query; spatial big data

Year:  2022        PMID: 36236746      PMCID: PMC9572315          DOI: 10.3390/s22197651

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.847


  2 in total

1.  SparkGIS: Resource Aware Efficient In-Memory Spatial Query Processing.

Authors:  Furqan Baig; Hoang Vo; Tahsin Kurc; Joel Saltz; Fusheng Wang
Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf       Date:  2017-11

2.  LocationSpark: In-memory Distributed Spatial Query Processing and Optimization.

Authors:  Mingjie Tang; Yongyang Yu; Ahmed R Mahmood; Qutaibah M Malluhi; Mourad Ouzzani; Walid G Aref
Journal:  Front Big Data       Date:  2020-10-16
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.