Literature DB >> 32589587

Block-term tensor neural networks.

Jinmian Ye1, Guangxi Li2, Di Chen1, Haiqin Yang3, Shandian Zhe4, Zenglin Xu5.   

Abstract

Deep neural networks (DNNs) have achieved outstanding performance in a wide range of applications, e.g., image classification, natural language processing, etc. Despite the good performance, the huge number of parameters in DNNs brings challenges to efficient training of DNNs and also their deployment in low-end devices with limited computing resources. In this paper, we explore the correlations in the weight matrices, and approximate the weight matrices with the low-rank block-term tensors. We name the new corresponding structure as block-term tensor layers (BT-layers), which can be easily adapted to neural network models, such as CNNs and RNNs. In particular, the inputs and the outputs in BT-layers are reshaped into low-dimensional high-order tensors with a similar or improved representation power. Sufficient experiments have demonstrated that BT-layers in CNNs and RNNs can achieve a very large compression ratio on the number of parameters while preserving or improving the representation power of the original DNNs.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Keywords:  Deep learning; Network compression; Neural networks; Tensor networks

Year:  2020        PMID: 32589587     DOI: 10.1016/j.neunet.2020.05.034

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  1 in total

1.  Compact Neural Architecture Designs by Tensor Representations.

Authors:  Jiahao Su; Jingling Li; Xiaoyu Liu; Teresa Ranadive; Christopher Coley; Tai-Ching Tuan; Furong Huang
Journal:  Front Artif Intell       Date:  2022-03-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.