Literature DB >> 26941443

GPU-Accelerated Adjoint Algorithmic Differentiation.

Felix Gremse1, Andreas Höfter2, Lukas Razik1, Fabian Kiessling3, Uwe Naumann2.   

Abstract

Many scientific problems such as classifier training or medical image reconstruction can be expressed as minimization of differentiable real-valued cost functions and solved with iterative gradient-based methods. Adjoint algorithmic differentiation (AAD) enables automated computation of gradients of such cost functions implemented as computer programs. To backpropagate adjoint derivatives, excessive memory is potentially required to store the intermediate partial derivatives on a dedicated data structure, referred to as the "tape". Parallelization is difficult because threads need to synchronize their accesses during taping and backpropagation. This situation is aggravated for many-core architectures, such as Graphics Processing Units (GPUs), because of the large number of light-weight threads and the limited memory size in general as well as per thread. We show how these limitations can be mediated if the cost function is expressed using GPU-accelerated vector and matrix operations which are recognized as intrinsic functions by our AAD software. We compare this approach with naive and vectorized implementations for CPUs. We use four increasingly complex cost functions to evaluate the performance with respect to memory consumption and gradient computation times. Using vectorization, CPU and GPU memory consumption could be substantially reduced compared to the naive reference implementation, in some cases even by an order of complexity. The vectorization allowed usage of optimized parallel libraries during forward and reverse passes which resulted in high speedups for the vectorized CPU version compared to the naive reference implementation. The GPU version achieved an additional speedup of 7.5 ± 4.4, showing that the processing power of GPUs can be utilized for AAD using this concept. Furthermore, we show how this software can be systematically extended for more complex problems such as nonlinear absorption reconstruction for fluorescence-mediated tomography.

Entities:  

Keywords:  Adjoint Algorithmic Differentiation; GPU Programming

Year:  2016        PMID: 26941443      PMCID: PMC4772124          DOI: 10.1016/j.cpc.2015.10.027

Source DB:  PubMed          Journal:  Comput Phys Commun        ISSN: 0010-4655            Impact factor:   4.390


  8 in total

1.  OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems.

Authors:  John E Stone; David Gohara; Guochun Shi
Journal:  Comput Sci Eng       Date:  2010-05       Impact factor: 2.080

2.  FMT-XCT: in vivo animal studies with hybrid fluorescence molecular tomography-X-ray computed tomography.

Authors:  Angelique Ale; Vladimir Ermolayev; Eva Herzog; Christian Cohrs; Martin Hrabé de Angelis; Vasilis Ntziachristos
Journal:  Nat Methods       Date:  2012-05-06       Impact factor: 28.547

3.  Noninvasive optical imaging of nanomedicine biodistribution.

Authors:  Sijumon Kunjachan; Felix Gremse; Benjamin Theek; Patrick Koczera; Robert Pola; Michal Pechar; Tomas Etrych; Karel Ulbrich; Gert Storm; Fabian Kiessling; Twan Lammers
Journal:  ACS Nano       Date:  2012-10-24       Impact factor: 15.881

Review 4.  Pre-clinical whole-body fluorescence imaging: Review of instruments, methods and applications.

Authors:  Frederic Leblond; Scott C Davis; Pablo A Valdés; Brian W Pogue
Journal:  J Photochem Photobiol B       Date:  2009-11-26       Impact factor: 6.252

5.  Characterizing EPR-mediated passive drug targeting using contrast-enhanced functional ultrasound imaging.

Authors:  Benjamin Theek; Felix Gremse; Sijumon Kunjachan; Stanley Fokong; Robert Pola; Michal Pechar; Roel Deckers; Gert Storm; Josef Ehling; Fabian Kiessling; Twan Lammers
Journal:  J Control Release       Date:  2014-03-12       Impact factor: 9.776

6.  Hybrid µCT-FMT imaging and image analysis.

Authors:  Felix Gremse; Dennis Doleschel; Sara Zafarnia; Anne Babler; Willi Jahnen-Dechent; Twan Lammers; Wiltrud Lederle; Fabian Kiessling
Journal:  J Vis Exp       Date:  2015-06-04       Impact factor: 1.355

7.  Absorption reconstruction improves biodistribution assessment of fluorescent nanoprobes using hybrid fluorescence-mediated tomography.

Authors:  Felix Gremse; Benjamin Theek; Sijumon Kunjachan; Wiltrud Lederle; Alessa Pardo; Stefan Barth; Twan Lammers; Uwe Naumann; Fabian Kiessling
Journal:  Theranostics       Date:  2014-07-26       Impact factor: 11.556

8.  Passive versus active tumor targeting using RGD- and NGR-modified polymeric nanomedicines.

Authors:  Sijumon Kunjachan; Robert Pola; Felix Gremse; Benjamin Theek; Josef Ehling; Diana Moeckel; Benita Hermanns-Sachweh; Michal Pechar; Karel Ulbrich; Wim E Hennink; Gert Storm; Wiltrud Lederle; Fabian Kiessling; Twan Lammers
Journal:  Nano Lett       Date:  2014-01-17       Impact factor: 11.189

  8 in total
  3 in total

1.  Noninvasive Assessment of Elimination and Retention using CT-FMT and Kinetic Whole-body Modeling.

Authors:  Wa'el Al Rawashdeh; Simin Zuo; Andrea Melle; Lia Appold; Susanne Koletnik; Yoanna Tsvetkova; Nataliia Beztsinna; Andrij Pich; Twan Lammers; Fabian Kiessling; Felix Gremse
Journal:  Theranostics       Date:  2017-04-05       Impact factor: 11.556

2.  Intrinsic Respiratory Gating for Simultaneous Multi-Mouse μCT Imaging to Assess Liver Tumors.

Authors:  Mirko Thamm; Stefanie Rosenhain; Kevin Leonardic; Andreas Höfter; Fabian Kiessling; Franz Osl; Thomas Pöschinger; Felix Gremse
Journal:  Front Med (Lausanne)       Date:  2022-07-06

3.  Mixing Matrix-corrected Whole-body Pharmacokinetic Modeling Using Longitudinal Micro-computed Tomography and Fluorescence-mediated Tomography.

Authors:  Simin Zuo; Wa'el Al Rawashdeh; Stefanie Rosenhain; Zuzanna Magnuska; Yamoah Grace Gyamfuah; Fabian Kiessling; Felix Gremse
Journal:  Mol Imaging Biol       Date:  2021-07-06       Impact factor: 3.488

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.