Literature DB >> 22347787

Compute-unified device architecture implementation of a block-matching algorithm for multiple graphical processing unit cards.

Francesc Massanes1, Marie Cadennes, Jovan G Brankov.   

Abstract

In this paper we describe and evaluate a fast implementation of a classical block matching motion estimation algorithm for multiple Graphical Processing Units (GPUs) using the Compute Unified Device Architecture (CUDA) computing engine. The implemented block matching algorithm (BMA) uses summed absolute difference (SAD) error criterion and full grid search (FS) for finding optimal block displacement. In this evaluation we compared the execution time of a GPU and CPU implementation for images of various sizes, using integer and non-integer search grids.The results show that use of a GPU card can shorten computation time by a factor of 200 times for integer and 1000 times for a non-integer search grid. The additional speedup for non-integer search grid comes from the fact that GPU has built-in hardware for image interpolation. Further, when using multiple GPU cards, the presented evaluation shows the importance of the data splitting method across multiple cards, but an almost linear speedup with a number of cards is achievable.In addition we compared execution time of the proposed FS GPU implementation with two existing, highly optimized non-full grid search CPU based motion estimations methods, namely implementation of the Pyramidal Lucas Kanade Optical flow algorithm in OpenCV and Simplified Unsymmetrical multi-Hexagon search in H.264/AVC standard. In these comparisons, FS GPU implementation still showed modest improvement even though the computational complexity of FS GPU implementation is substantially higher than non-FS CPU implementation.We also demonstrated that for an image sequence of 720×480 pixels in resolution, commonly used in video surveillance, the proposed GPU implementation is sufficiently fast for real-time motion estimation at 30 frames-per-second using two NVIDIA C1060 Tesla GPU cards.

Entities:  

Year:  2011        PMID: 22347787      PMCID: PMC3280822          DOI: 10.1117/1.3606588

Source DB:  PubMed          Journal:  J Electron Imaging        ISSN: 1017-9909            Impact factor:   0.945


  3 in total

1.  Deformable left-ventricle mesh model for motion-compensated filtering in cardiac gated SPECT.

Authors:  Thibault Marin; Jovan G Brankov
Journal:  Med Phys       Date:  2010-10       Impact factor: 4.071

2.  Motion compensation in digital subtraction angiography using graphics hardware.

Authors:  Yu Deuerling-Zheng; Michael Lell; Adam Galant; Joachim Hornegger
Journal:  Comput Med Imaging Graph       Date:  2006-08-14       Impact factor: 4.790

3.  Simplified electroholographic color reconstruction system using graphics processing unit and liquid crystal display projector.

Authors:  Atsushi Shiraki; Naoki Takada; Masashi Niwa; Yasuyuki Ichihashi; Tomoyoshi Shimobaba; Nobuyuki Masuda; Tomoyoshi Ito
Journal:  Opt Express       Date:  2009-08-31       Impact factor: 3.894

  3 in total
  2 in total

1.  Computing global minimizers to a constrained B-spline image registration problem from optimal l1 perturbations to block match data.

Authors:  Edward Castillo; Richard Castillo; David Fuentes; Thomas Guerrero
Journal:  Med Phys       Date:  2014-04       Impact factor: 4.071

2.  Block Matching Pyramid Algorithm-Based Analysis on Efficacy of Shexiang Baoxin Pills Guided by Echocardiogram (ECG) on Patients with Angina Pectoris in Coronary Heart Disease.

Authors:  Junqing Gao; Xu Wang; Lingyan Li; Hong Zhang; Ruiqing He; Bingyu Han; Zongjun Li
Journal:  J Healthc Eng       Date:  2021-08-06       Impact factor: 2.682

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.