Mattias P Heinrich1, Max Blendowski2, Ozan Oktay3. 1. Institute of Medical Informatics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany. heinrich@imi.uni-luebeck.de. 2. Institute of Medical Informatics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany. 3. Biomedical Image Analysis Group, Department of Computing, Imperial College London, London, SW7 2AZ, UK.
Abstract
PURPOSE: Deep convolutional neural networks (DCNN) are currently ubiquitous in medical imaging. While their versatility and high-quality results for common image analysis tasks including segmentation, localisation and prediction is astonishing, the large representational power comes at the cost of highly demanding computational effort. This limits their practical applications for image-guided interventions and diagnostic (point-of-care) support using mobile devices without graphics processing units (GPU). METHODS: We propose a new scheme that approximates both trainable weights and neural activations in deep networks by ternary values and tackles the open question of backpropagation when dealing with non-differentiable functions. Our solution enables the removal of the expensive floating-point matrix multiplications throughout any convolutional neural network and replaces them by energy- and time-preserving binary operators and population counts. RESULTS: We evaluate our approach for the segmentation of the pancreas in CT. Here, our ternary approximation within a fully convolutional network leads to more than 90% memory reductions and high accuracy (without any post-processing) with a Dice overlap of 71.0% that comes close to the one obtained when using networks with high-precision weights and activations. We further provide a concept for sub-second inference without GPUs and demonstrate significant improvements in comparison with binary quantisation and without our proposed ternary hyperbolic tangent continuation. CONCLUSIONS: We present a key enabling technique for highly efficient DCNN inference without GPUs that will help to bring the advances of deep learning to practical clinical applications. It has also great promise for improving accuracies in large-scale medical data retrieval.
PURPOSE: Deep convolutional neural networks (DCNN) are currently ubiquitous in medical imaging. While their versatility and high-quality results for common image analysis tasks including segmentation, localisation and prediction is astonishing, the large representational power comes at the cost of highly demanding computational effort. This limits their practical applications for image-guided interventions and diagnostic (point-of-care) support using mobile devices without graphics processing units (GPU). METHODS: We propose a new scheme that approximates both trainable weights and neural activations in deep networks by ternary values and tackles the open question of backpropagation when dealing with non-differentiable functions. Our solution enables the removal of the expensive floating-point matrix multiplications throughout any convolutional neural network and replaces them by energy- and time-preserving binary operators and population counts. RESULTS: We evaluate our approach for the segmentation of the pancreas in CT. Here, our ternary approximation within a fully convolutional network leads to more than 90% memory reductions and high accuracy (without any post-processing) with a Dice overlap of 71.0% that comes close to the one obtained when using networks with high-precision weights and activations. We further provide a concept for sub-second inference without GPUs and demonstrate significant improvements in comparison with binary quantisation and without our proposed ternary hyperbolic tangent continuation. CONCLUSIONS: We present a key enabling technique for highly efficient DCNN inference without GPUs that will help to bring the advances of deep learning to practical clinical applications. It has also great promise for improving accuracies in large-scale medical data retrieval.
Keywords:
Deep learning; Hamming distance; Model compression; Pancreas; Segmentation; Sparsity
Authors: Michael Calonder; Vincent Lepetit; Mustafa Özuysal; Tomasz Trzcinski; Christoph Strecha; Pascal Fua Journal: IEEE Trans Pattern Anal Mach Intell Date: 2011-11-15 Impact factor: 6.226
Authors: Zhoubing Xu; Ryan P Burke; Christopher P Lee; Rebeccah B Baucom; Benjamin K Poulose; Richard G Abramson; Bennett A Landman Journal: Med Image Anal Date: 2015-05-21 Impact factor: 8.545
Authors: Mattias P Heinrich; Mark Jenkinson; Bartlomiej W Papiez; Fergus V Glesson; Sir Michael Brady; Julia A Schnabel Journal: Inf Process Med Imaging Date: 2013
Authors: Zhoubing Xu; Sahil A Panjwani; Christopher P Lee; Ryan P Burke; Rebeccah B Baucom; Benjamin K Poulose; Richard G Abramson; Bennett A Landman Journal: Proc SPIE Int Soc Opt Eng Date: 2016-03-21
Authors: Zhoubing Xu; Christopher P Lee; Mattias P Heinrich; Marc Modat; Daniel Rueckert; Sebastien Ourselin; Richard G Abramson; Bennett A Landman Journal: IEEE Trans Biomed Eng Date: 2016-06-01 Impact factor: 4.538
Authors: Jo Schlemper; Ozan Oktay; Michiel Schaap; Mattias Heinrich; Bernhard Kainz; Ben Glocker; Daniel Rueckert Journal: Med Image Anal Date: 2019-02-05 Impact factor: 8.545