| Literature DB >> 28085937 |
Shaoming Pan1, Yanwen Chong1, Hang Zhang2, Xicheng Tan3,4.
Abstract
A web geographical information system is a typical service-intensive application. Tile prefetching and cache replacement can improve cache hit ratios by proactively fetching tiles from storage and replacing the appropriate tiles from the high-speed cache buffer without waiting for a client's requests, which reduces disk latency and improves system access performance. Most popular prefetching strategies consider only the relative tile popularities to predict which tile should be prefetched or consider only a single individual user's access behavior to determine which neighbor tiles need to be prefetched. Some studies show that comprehensively considering all users' access behaviors and all tiles' relationships in the prediction process can achieve more significant improvements. Thus, this work proposes a new global user-driven model for tile prefetching and cache replacement. First, based on all users' access behaviors, a type of expression method for tile correlation is designed and implemented. Then, a conditional prefetching probability can be computed based on the proposed correlation expression mode. Thus, some tiles to be prefetched can be found by computing and comparing the conditional prefetching probability from the uncached tiles set and, similarly, some replacement tiles can be found in the cache buffer according to multi-step prefetching. Finally, some experiments are provided comparing the proposed model with other global user-driven models, other single user-driven models, and other client-side prefetching strategies. The results show that the proposed model can achieve a prefetching hit rate in approximately 10.6% ~ 110.5% higher than the compared methods.Entities:
Year: 2017 PMID: 28085937 PMCID: PMC5234825 DOI: 10.1371/journal.pone.0170195
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
An example of using different conditions to make predictions.
| Conditions | Prediction |
|---|---|
| The last requested tile ( | The best choice is |
| The last two tiles (× | The best choice is |
| The last three tiles (×× | The best choice is |
| The last four tiles (××× | The best choice is |
* One or more tiles requested together are considered as a type of fixed access mode.
**Complex conditions can result in a more rigorous and more accurate forecast.
The datasets parameters used in the experiments.
| Parameters | Value |
|---|---|
| Dataset name | SRTM90 |
| Data size | 55242 ~ 663552 |
| Number of trace | 2 for each data size |
| Trace size | 2 million ~ 20 million |
*Based on the Zipf-like laws, only subsets of the hotspot tiles will be requested.
**The traces record only the labels of requested tiles in chronological order.
Fig 1Comparative CHRs obtained from different prefetching algorithms based on multi-user mode.
Fig 2Comparative CHRs obtained from different prefetching algorithms based on single-user mode.
A performance comparison of various algorithms based on the client-side prefetching mode.
| Algorithms | Average cache hit ratio (CHR) (%) | Cache space (tiles) | CHR based on normalized cache space (1 tile) (%) |
|---|---|---|---|
| GUDC | 56.55% | 1000 | 0.05655 |
| DCST | 50.22% | 1000 | 0.05022 |
| ZM | 47.83% | 1000 | 0.04783 |
| ZL | 34.85% | 1000 | 0.03485 |
| BM | 25.08% | 1000 | 0.02508 |
| RAP | 50.33% | 984 | 0.05115 |
| PKM | 29.41% | 1041 | 0.02825 |
Fig 3Comparative CHRs obtained from GUDC based on different global user behaviors.
Fig 4Comparative CHRs obtained from GUDC based on different data sizes.
Fig 5Comparative CHRs obtained from GUDC using different prefetching steps.
Fig 6Comparative CHRs obtained from GUDC using different cache replacement strategies.
Comparative CHRs and disk access ratio based on different matching radius.
| Matching radius ( | Disk access ratio (%) | Average cache hit ratio (CHR) (%) |
|---|---|---|
| 1 | 94.77% | 48.27% |
| 3 | 86.92% | 48.36% |
| 5 | 85.23% | 48.56% |
| 14 | 89.62% | 48.39% |
Fig 7Comparative disk access performance obtained from different algorithms.