Literature DB >> 33796879

Exploiting Computation Reuse for Stencil Accelerators.

Yuze Chi1, Jason Cong1.   

Abstract

Stencil kernel is an important type of kernel used extensively in many application domains. Over the years, researchers have been studying the optimizations on parallelization, communication reuse, and computation reuse for various target platforms. However, challenges still exist, especially on the computation reuse problem for accelerators, due to the lack of complete design-space exploration and effective design-space pruning. In this paper, we present solutions to the above challenges for a wide range of stencil kernels (i.e., stencil with reduction operations), where the computation reuse patterns are extremely flexible due to the commutative and associative properties. We formally define the complete design space, based on which we present a provably optimal dynamic programming algorithm and a heuristic beam search algorithm that provides near-optimal solutions under an architecture-aware model. Experimental results show that for synthesizing stencil kernels to FPGAs, compared with state-of-the-art stencil compiler without computation reuse capability, our proposed algorithm can reduce the look-up table (LUT) and digital signal processor (DSP) usage by 58.1% and 54.6% on average respectively, which leads to an average speedup of 2.3× for compute-intensive kernels, outperforming the latest CPU/GPU results.

Entities:  

Year:  2020        PMID: 33796879      PMCID: PMC8011578          DOI: 10.1109/dac18072.2020.9218680

Source DB:  PubMed          Journal:  Proc Des Autom Conf


  1 in total

1.  Accelerating SSSP for Power-Law Graphs.

Authors:  Yuze Chi; Licheng Guo; Jason Cong
Journal:  FPGA       Date:  2022-02-11
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.