Literature DB >> 25264418

Computing Spatial Distance Histograms for Large Scientific Datasets On-the-Fly.

Anand Kumar1, Vladimir Grupcev1, Yongke Yuan2, Jin Huang3, Yi-Cheng Tu1, Gang Shen4.   

Abstract

This paper focuses on an important query in scientific simulation data analysis: the Spatial Distance Histogram (SDH). The computation time of an SDH query using brute force method is quadratic. Often, such queries are executed continuously over certain time periods, increasing the computation time. We propose highly efficient approximate algorithm to compute SDH over consecutive time periods with provable error bounds. The key idea of our algorithm is to derive statistical distribution of distances from the spatial and temporal characteristics of particles. Upon organizing the data into a Quad-tree based structure, the spatiotemporal characteristics of particles in each node of the tree are acquired to determine the particles' spatial distribution as well as their temporal locality in consecutive time periods. We report our efforts in implementing and optimizing the above algorithm in Graphics Processing Units (GPUs) as means to further improve the efficiency. The accuracy and efficiency of the proposed algorithm is backed by mathematical analysis and results of extensive experiments using data generated from real simulation studies.

Entities:  

Keywords:  GPU; Scientific databases; density map; quad-tree; spatial distance histogram; spatiotemporal locality

Year:  2014        PMID: 25264418      PMCID: PMC4172330          DOI: 10.1109/TKDE.2014.2298015

Source DB:  PubMed          Journal:  IEEE Trans Knowl Data Eng        ISSN: 1041-4347            Impact factor:   6.977


  4 in total

1.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation.

Authors:  Berk Hess; Carsten Kutzner; David van der Spoel; Erik Lindahl
Journal:  J Chem Theory Comput       Date:  2008-03       Impact factor: 6.006

2.  Performance analysis of a dual-tree algorithm for computing spatial distance histograms.

Authors:  Shaoping Chen; Yi-Cheng Tu; Yuni Xia
Journal:  VLDB J       Date:  2011-08-01       Impact factor: 2.868

3.  Approximate Algorithms for Computing Spatial Distance Histograms with Accuracy Guarantees.

Authors:  Vladimir Grupcev; Yongke Yuan; Yi-Cheng Tu; Jin Huang; Shaoping Chen; Sagar Pandit; Michael Weng
Journal:  IEEE Trans Knowl Data Eng       Date:  2012-09-01       Impact factor: 6.977

4.  Distance Histogram Computation Based on Spatiotemporal Uniformity in Scientific Data.

Authors:  Anand Kumar; Vladimir Grupcev; Yongke Yuan; Yi-Cheng Tu; Gang Shen
Journal:  Adv Database Technol       Date:  2012
  4 in total
  1 in total

1.  DCMS: A data analytics and management system for molecular simulation.

Authors:  Anand Kumar; Vladimir Grupcev; Meryem Berrada; Joseph C Fogarty; Yi-Cheng Tu; Xingquan Zhu; Sagar A Pandit; Yuni Xia
Journal:  J Big Data       Date:  2014-11-26
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.