| Literature DB >> 27617325 |
Ablimit Aji1, Xiling Sun2, Hoang Vo1, Qioaling Liu1, Rubao Lee3, Xiaodong Zhang3, Joel Saltz1, Fusheng Wang1.
Abstract
The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last decade. Support of high performance spatial queries on large volumes data has become increasingly important in numerous fields, which requires a scalable and efficient spatial data warehousing solution as existing approaches exhibit scalability limitations and efficiency bottlenecks for large scale spatial applications. In this demonstration, we present Hadoop-GIS - a scalable and high performance spatial query system over MapReduce. Hadoop-GIS provides an efficient spatial query engine to process spatial queries, data and space based partitioning, and query pipelines that parallelize queries implicitly on MapReduce. Hadoop-GIS also provides an expressive, SQL-like spatial query language for workload specification. We will demonstrate how spatial queries are expressed in spatially extended SQL queries, and submitted through a command line/web interface for execution. Parallel to our system demonstration, we explain the system architecture and details on how queries are translated to MapReduce operators, optimized, and executed on Hadoop. In addition, we will showcase how the system can be used to support two representative real world use cases: large scale pathology analytical imaging, and geo-spatial data warehousing.Entities:
Keywords: Analytical Imaging; Data Warehouse; Database; Hive; MapReduce; Scientific Data Management; Spatial Query Processing
Year: 2013 PMID: 27617325 PMCID: PMC5013659 DOI: 10.1145/2525314.2525320
Source DB: PubMed Journal: Proc ACM SIGSPATIAL Int Conf Adv Inf