Literature DB >> 24187650

Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce.

Ablimit Aji1, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang, Joel Saltz.   

Abstract

Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services, and emerging scientific applications that are increasingly data- and compute-intensive. The emergence of massive scale spatial data is due to the proliferation of cost effective and ubiquitous positioning technologies, development of high resolution imaging technologies, and contribution from a large number of community users. There are two major challenges for managing and querying massive spatial data to support spatial queries: the explosion of spatial data, and the high computational complexity of spatial queries. In this paper, we present Hadoop-GIS - a scalable and high performance spatial data warehousing system for running large scale spatial queries on Hadoop. Hadoop-GIS supports multiple types of spatial queries on MapReduce through spatial partitioning, customizable spatial query engine RESQUE, implicit parallel spatial query execution on MapReduce, and effective methods for amending query results through handling boundary objects. Hadoop-GIS utilizes global partition indexing and customizable on demand local spatial indexing to achieve efficient query processing. Hadoop-GIS is integrated into Hive to support declarative spatial queries with an integrated architecture. Our experiments have demonstrated the high efficiency of Hadoop-GIS on query response and high scalability to run on commodity clusters. Our comparative experiments have showed that performance of Hadoop-GIS is on par with parallel SDBMS and outperforms SDBMS for compute-intensive queries. Hadoop-GIS is available as a set of library for processing spatial queries, and as an integrated software package in Hive.

Entities:  

Year:  2013        PMID: 24187650      PMCID: PMC3814183     

Source DB:  PubMed          Journal:  Proceedings VLDB Endowment        ISSN: 2150-8097


  4 in total

1.  Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.

Authors:  Ablimit Aji; Fusheng Wang; Joel H Saltz
Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf       Date:  2012-11-06

2.  Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems.

Authors:  Kaibo Wang; Yin Huai; Rubao Lee; Fusheng Wang; Xiaodong Zhang; Joel H Saltz
Journal:  Proceedings VLDB Endowment       Date:  2012-07

3.  A data model and database for high-resolution pathology analytical image informatics.

Authors:  Fusheng Wang; Jun Kong; Lee Cooper; Tony Pan; Tahsin Kurc; Wenjin Chen; Ashish Sharma; Cristobal Niedermayr; Tae W Oh; Daniel Brat; Alton B Farris; David J Foran; Joel Saltz
Journal:  J Pathol Inform       Date:  2011-07-26

4.  A high-performance spatial database based approach for pathology imaging algorithm evaluation.

Authors:  Fusheng Wang; Jun Kong; Jingjing Gao; Lee A D Cooper; Tahsin Kurc; Zhengwen Zhou; David Adler; Cristobal Vergara-Niedermayr; Bryan Katigbak; Daniel J Brat; Joel H Saltz
Journal:  J Pathol Inform       Date:  2013-03-14
  4 in total
  20 in total

1.  iSPEED: an Efficient In-Memory Based Spatial Query System for Large-Scale 3D Data with Complex Structures.

Authors:  Yanhui Liang; Jun Kong; Hoang Vo; Fusheng Wang
Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf       Date:  2017-11

2.  Scalable 3D Spatial Queries for Analytical Pathology Imaging with MapReduce.

Authors:  Yanhui Liang; Hoang Vo; Ablimit Aji; Jun Kong; Fusheng Wang
Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf       Date:  2016 Oct-Nov

3.  Integrative Spatial Data Analytics for Public Health Studies of New York State.

Authors:  Xin Chen; Fusheng Wang
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

4.  Safe "cloudification" of large images through picker APIs.

Authors:  Erich Bremer; Tahsin Kurc; Yi Gao; Joel Saltz; Jonas S Almeida
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

5.  Multi-objective Parameter Auto-tuning for Tissue Image Segmentation Workflows.

Authors:  Luis F R Taveira; Tahsin Kurc; Alba C M A Melo; Jun Kong; Erich Bremer; Joel H Saltz; George Teodoro
Journal:  J Digit Imaging       Date:  2019-06       Impact factor: 4.056

6.  SparkGIS: Efficient Comparison and Evaluation of Algorithm Results in Tissue Image Analysis Studies.

Authors:  Furqan Baig; Mudit Mehrotra; Hoang Vo; Fusheng Wang; Joel Saltz; Tahsin Kurc
Journal:  Biomed Data Manag Graph Online Querying (2015)       Date:  2016-06-24

7.  Parallel Versus Distributed Data Access for Gigapixel-Resolution Histology Images: Challenges and Opportunities.

Authors:  Esma Yildirim; David J Foran
Journal:  IEEE J Biomed Health Inform       Date:  2016-06-13       Impact factor: 5.772

8.  Medical Big Data Warehouse: Architecture and System Design, a Case Study: Improving Healthcare Resources Distribution.

Authors:  Abderrazak Sebaa; Fatima Chikh; Amina Nouicer; AbdelKamel Tari
Journal:  J Med Syst       Date:  2018-02-19       Impact factor: 4.460

9.  MaReIA: A Cloud MapReduce Based High Performance Whole Slide Image Analysis Framework.

Authors:  Hoang Vo; Jun Kong; Dejun Teng; Yanhui Liang; Ablimit Aji; George Teodoro; Fusheng Wang
Journal:  Distrib Parallel Databases       Date:  2018-07-30       Impact factor: 1.500

10.  SparkGIS: Resource Aware Efficient In-Memory Spatial Query Processing.

Authors:  Furqan Baig; Hoang Vo; Tahsin Kurc; Joel Saltz; Fusheng Wang
Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf       Date:  2017-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.