| Literature DB >> 30380798 |
Jong Seon Kim1, Yon Dohn Chung2, Jong Wook Kim3.
Abstract
Mobile Crowdsensing (MCS) is a paradigm for collecting large-scale sensor data by leveraging mobile devices equipped with small and low-powered sensors. MCS has recently received considerable attention from diverse fields, because it can reduce the cost incurred in the process of collecting a large amount of sensor data. However, in the task assignment process in MCS, to allocate the requested tasks efficiently, the workers need to send their specific location to the requester, which can raise serious location privacy issues. In this paper, we focus on the methods for publishing differentially a private spatial histogram to guarantee the location privacy of the workers. The private spatial histogram is a sanitized spatial index where each node represents the sub-regions and contains the noisy counts of the objects in each sub-region. With the sanitized spatial histograms, it is possible to estimate approximately the number of workers in the arbitrary area, while preserving their location privacy. However, the existing methods have given little concern to the domain size of the input dataset, leading to the low estimation accuracy. This paper proposes a partitioning technique SAGA (Skew-Aware Grid pArtitioning) based on the hotspots, which is more appropriate to adjust the domain size of the dataset. Further, to optimize the overall errors, we lay a uniform grid in each hotspot. Experimental results on four real-world datasets show that our method provides an enhanced query accuracy compared to the existing methods.Entities:
Keywords: differential privacy; histograms; mobile crowdsensing; spatial databases
Mesh:
Year: 2018 PMID: 30380798 PMCID: PMC6263808 DOI: 10.3390/s18113696
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1A framework for mobile crowdsensing using PSD. MCS, Mobile Crowdsensing.
Figure 4Examples of a spatial domain and various corresponding PSD structures.
Figure 6Hotspot detection using the sliding window mechanism.
Figure 7A noisy hotspot boundary of Figure 6d by Algorithm 1.
Figure 8An example of the remaining regions’ partition.
Figure 9Illustration of datasets.
Information about datasets.
| Dataset | Total Number of Points | Domain Size |
|---|---|---|
|
| 61,391 |
|
|
| 132,088 |
|
|
| 28,532 |
|
|
| 1,325,737 |
|
Parameter configurations for each method.
| Method | Description |
|---|---|
|
| Grid size: |
|
| First grid size: |
|
| Total height: 6, Switching height: 3 |
|
| h-tree size: |
|
| Bias factor: |
s and f values of SAGA used in each dataset ().
| Dataset | Total privacy budget | ||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
|
| 230 | 460 | 690 | 920 | 1151 |
|
| 495 | 990 | 1486 | 1981 | 2476 |
|
| 107 | 214 | 321 | 428 | 535 |
|
| 4971 | 9943 | 14,914 | 19,886 | 24,857 |
Figure 10Average relative error on the Foursquare dataset.
Figure 11Average relative error on the Gowalla dataset.
Figure 12Average relative error on the Tdrive dataset.
Figure 13Average relative error on the TIGER dataset.