Literature DB >> 24501719

Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.

Ablimit Aji1, Fusheng Wang2, Joel H Saltz3.   

Abstract

Support of high performance queries on large volumes of scientific spatial data is becoming increasingly important in many applications. This growth is driven by not only geospatial problems in numerous fields, but also emerging scientific applications that are increasingly data- and compute-intensive. For example, digital pathology imaging has become an emerging field during the past decade, where examination of high resolution images of human tissue specimens enables more effective diagnosis, prediction and treatment of diseases. Systematic analysis of large-scale pathology images generates tremendous amounts of spatially derived quantifications of micro-anatomic objects, such as nuclei, blood vessels, and tissue regions. Analytical pathology imaging provides high potential to support image based computer aided diagnosis. One major requirement for this is effective querying of such enormous amount of data with fast response, which is faced with two major challenges: the "big data" challenge and the high computation complexity. In this paper, we present our work towards building a high performance spatial query system for querying massive spatial data on MapReduce. Our framework takes an on demand index building approach for processing spatial queries and a partition-merge approach for building parallel spatial query pipelines, which fits nicely with the computing model of MapReduce. We demonstrate our framework on supporting multi-way spatial joins for algorithm evaluation and nearest neighbor queries for microanatomic objects. To reduce query response time, we propose cost based query optimization to mitigate the effect of data skew. Our experiments show that the framework can efficiently support complex analytical spatial queries on MapReduce.

Entities:  

Keywords:  Data Skew; Design; Experimentation; Management; MapReduce; Pathology Imaging; Performance; Spatial Query Processing

Year:  2012        PMID: 24501719      PMCID: PMC3909999          DOI: 10.1145/2424321.2424361

Source DB:  PubMed          Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf


  6 in total

1.  The virtual microscope.

Authors:  Umit Catalyürek; Michael D Beynon; Chialin Chang; Tahsin Kurc; Alan Sussman; Joel Saltz
Journal:  IEEE Trans Inf Technol Biomed       Date:  2003-12

2.  Integrative, multimodal analysis of glioblastoma using TCGA molecular data, pathology images, and clinical outcomes.

Authors:  Jun Kong; Lee A D Cooper; Fusheng Wang; David A Gutman; Jingjing Gao; Candace Chisolm; Ashish Sharma; Tony Pan; Erwin G Van Meir; Tahsin M Kurc; Carlos S Moreno; Joel H Saltz; Daniel J Brat
Journal:  IEEE Trans Biomed Eng       Date:  2011-09-23       Impact factor: 4.538

3.  The Open Microscopy Environment (OME) Data Model and XML file: open tools for informatics and quantitative analysis in biological imaging.

Authors:  Ilya G Goldberg; Chris Allan; Jean-Marie Burel; Doug Creager; Andrea Falconi; Harry Hochheiser; Josiah Johnston; Jeff Mellen; Peter K Sorger; Jason R Swedlow
Journal:  Genome Biol       Date:  2005-05-03       Impact factor: 13.583

4.  Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems.

Authors:  Kaibo Wang; Yin Huai; Rubao Lee; Fusheng Wang; Xiaodong Zhang; Joel H Saltz
Journal:  Proceedings VLDB Endowment       Date:  2012-07

5.  Pseudopalisades in glioblastoma are hypoxic, express extracellular matrix proteases, and are formed by an actively migrating cell population.

Authors:  Daniel J Brat; Amilcar A Castellano-Sanchez; Stephen B Hunter; Marcia Pecot; Cynthia Cohen; Elizabeth H Hammond; Sarojini N Devi; Balveen Kaur; Erwin G Van Meir
Journal:  Cancer Res       Date:  2004-02-01       Impact factor: 12.701

6.  Integrated morphologic analysis for the identification and characterization of disease subtypes.

Authors:  Lee A D Cooper; Jun Kong; David A Gutman; Fusheng Wang; Jingjing Gao; Christina Appin; Sharath Cholleti; Tony Pan; Ashish Sharma; Lisa Scarpace; Tom Mikkelsen; Tahsin Kurc; Carlos S Moreno; Daniel J Brat; Joel H Saltz
Journal:  J Am Med Inform Assoc       Date:  2012-01-24       Impact factor: 4.497

  6 in total
  5 in total

1.  MaReIA: A Cloud MapReduce Based High Performance Whole Slide Image Analysis Framework.

Authors:  Hoang Vo; Jun Kong; Dejun Teng; Yanhui Liang; Ablimit Aji; George Teodoro; Fusheng Wang
Journal:  Distrib Parallel Databases       Date:  2018-07-30       Impact factor: 1.500

2.  Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce.

Authors:  Ablimit Aji; Fusheng Wang; Hoang Vo; Rubao Lee; Qiaoling Liu; Xiaodong Zhang; Joel Saltz
Journal:  Proceedings VLDB Endowment       Date:  2013-08

3.  Demonstration of Hadoop-GIS: A Spatial Data Warehousing System Over MapReduce.

Authors:  Ablimit Aji; Xiling Sun; Hoang Vo; Qioaling Liu; Rubao Lee; Xiaodong Zhang; Joel Saltz; Fusheng Wang
Journal:  Proc ACM SIGSPATIAL Int Conf Adv Inf       Date:  2013-11

Review 4.  Toward a Literature-Driven Definition of Big Data in Healthcare.

Authors:  Emilie Baro; Samuel Degoul; Régis Beuscart; Emmanuel Chazard
Journal:  Biomed Res Int       Date:  2015-06-02       Impact factor: 3.411

5.  Querying and Extracting Timeline Information from Road Traffic Sensor Data.

Authors:  Ardi Imawan; Fitri Indra Indikawati; Joonho Kwon; Praveen Rao
Journal:  Sensors (Basel)       Date:  2016-08-23       Impact factor: 3.576

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.