Literature DB >> 33830929

General Bitwidth Assignment for Efficient Deep Convolutional Neural Network Quantization.

Wen Fei, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong.   

Abstract

Model quantization is essential to deploy deep convolutional neural networks (DCNNs) on resource-constrained devices. In this article, we propose a general bitwidth assignment algorithm based on theoretical analysis for efficient layerwise weight and activation quantization of DCNNs. The proposed algorithm develops a prediction model to explicitly estimate the loss of classification accuracy led by weight quantization with a geometrical approach. Consequently, dynamic programming is adopted to achieve optimal bitwidth assignment on weights based on the estimated error. Furthermore, we optimize bitwidth assignment for activations by considering the signal-to-quantization-noise ratio (SQNR) between weight and activation quantization. The proposed algorithm is general to reveal the tradeoff between classification accuracy and model size for various network architectures. Extensive experiments demonstrate the efficacy of the proposed bitwidth assignment algorithm and the error rate prediction model. Furthermore, the proposed algorithm is shown to be well extended to object detection.

Entities:  

Mesh:

Year:  2022        PMID: 33830929     DOI: 10.1109/TNNLS.2021.3069886

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   14.255


  1 in total

1.  Visual Dissemination of Intangible Cultural Heritage Information Based on 3D Scanning and Virtual Reality Technology.

Authors:  Wulong Xu; Xijie Sun; Shihui Pan
Journal:  Scanning       Date:  2022-09-25       Impact factor: 1.750

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.