Literature DB >> 24378961

Distance Histogram Computation Based on Spatiotemporal Uniformity in Scientific Data.

Anand Kumar1, Vladimir Grupcev2, Yongke Yuan3, Yi-Cheng Tu, Gang Shen.   

Abstract

Large data generated by scientific applications imposes challenges in storage and efficient query processing. Many queries against scientific data are analytical in nature and require super-linear computation time using straightforward methods. Spatial distance histogram (SDH) is one of the basic queries to analyze the molecular simulation (MS) data, and it takes quadratic time to compute using brute-force approach. Often, an SDH query is executed continuously to analyze the simulation system over a period of time. This adds to the total time required to compute SDH. In this paper, we propose an approximate algorithm to compute SDH efficiently over consecutive time periods. In our approach, data is organized into a Quad-tree based data structure. The spatial locality of the particles (at given time) in each node of the tree is acquired to determine the particle distribution. Similarly, the temporal locality of particles (between consecutive time periods) in each node is also acquired. The spatial distribution and temporal locality are utilized to compute the approximate SDH at every time instant. The performance is boosted by storing and updating the spatial distribution information over time. The efficiency and accuracy of the proposed algorithm is supported by mathematical analysis and results of extensive experiments using biological data generated from real MS studies.

Entities:  

Keywords:  Algorithms; Experimentation; Performance; Scientific data; density map; quad-tree; spatial distance histogram; spatiotemporal locality

Year:  2012        PMID: 24378961      PMCID: PMC3873006          DOI: 10.1145/2247596.2247631

Source DB:  PubMed          Journal:  Adv Database Technol


  3 in total

1.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation.

Authors:  Berk Hess; Carsten Kutzner; David van der Spoel; Erik Lindahl
Journal:  J Chem Theory Comput       Date:  2008-03       Impact factor: 6.006

2.  Performance analysis of a dual-tree algorithm for computing spatial distance histograms.

Authors:  Shaoping Chen; Yi-Cheng Tu; Yuni Xia
Journal:  VLDB J       Date:  2011-08-01       Impact factor: 2.868

3.  Distance Histogram Computation Based on Spatiotemporal Uniformity in Scientific Data.

Authors:  Anand Kumar; Vladimir Grupcev; Yongke Yuan; Yi-Cheng Tu; Gang Shen
Journal:  Adv Database Technol       Date:  2012
  3 in total
  5 in total

1.  Computing Spatial Distance Histograms for Large Scientific Datasets On-the-Fly.

Authors:  Anand Kumar; Vladimir Grupcev; Yongke Yuan; Jin Huang; Yi-Cheng Tu; Gang Shen
Journal:  IEEE Trans Knowl Data Eng       Date:  2014-10       Impact factor: 6.977

2.  Efficient SDH Computation In Molecular Simulations Data.

Authors:  Yi-Cheng Tu; Shaoping Chen; Sagar Pandit; Anand Kumar; Vladimir Grupcev
Journal:  ACM BCB       Date:  2012-10

3.  Approximate Algorithms for Computing Spatial Distance Histograms with Accuracy Guarantees.

Authors:  Vladimir Grupcev; Yongke Yuan; Yi-Cheng Tu; Jin Huang; Shaoping Chen; Sagar Pandit; Michael Weng
Journal:  IEEE Trans Knowl Data Eng       Date:  2012-09-01       Impact factor: 6.977

4.  Distance Histogram Computation Based on Spatiotemporal Uniformity in Scientific Data.

Authors:  Anand Kumar; Vladimir Grupcev; Yongke Yuan; Yi-Cheng Tu; Gang Shen
Journal:  Adv Database Technol       Date:  2012

5.  Concurrent query processing in a GPU-based database system.

Authors:  Hao Li; Yi-Cheng Tu; Bo Zeng
Journal:  PLoS One       Date:  2019-04-16       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.