Literature DB >> 35073273

Automatic Sparse Connectivity Learning for Neural Networks.

Zhimin Tang, Linkai Luo, Bike Xie, Yiyu Zhu, Rujie Zhao, Lvqing Bi, Chao Lu.   

Abstract

Since sparse neural networks usually contain many zero weights, these unnecessary network connections can potentially be eliminated without degrading network performance. Therefore, well-designed sparse neural networks have the potential to significantly reduce the number of floating-point operations (FLOPs) and computational resources. In this work, we propose a new automatic pruning method--sparse connectivity learning (SCL). Specifically, a weight is reparameterized as an elementwise multiplication of a trainable weight variable and a binary mask. Thus, network connectivity is fully described by the binary mask, which is modulated by a unit step function. We theoretically prove the fundamental principle of using a straight-through estimator (STE) for network pruning. This principle is that the proxy gradients of STE should be positive, ensuring that mask variables converge at their minima. After finding Leaky ReLU, Softplus, and identity STEs can satisfy this principle, we propose to adopt identity STE in SCL for discrete mask relaxation. We find that mask gradients of different features are very unbalanced; hence, we propose to normalize mask gradients of each feature to optimize mask variable training. In order to automatically train sparse masks, we include the total number of network connections as a regularization term in our objective function. As SCL does not require pruning criteria or hyperparameters defined by designers for network layers, the network is explored in a larger hypothesis space to achieve optimized sparse connectivity for the best performance. SCL overcomes the limitations of existing automatic pruning methods. Experimental results demonstrate that SCL can automatically learn and select important network connections for various baseline network structures. Deep learning models trained by SCL outperform the state-of-the-art human-designed and automatic pruning methods in sparsity, accuracy, and FLOPs reduction.

Entities:  

Year:  2022        PMID: 35073273     DOI: 10.1109/TNNLS.2022.3141665

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  4 in total

1.  A Novel Deep Learning-Based Cooperative Communication Channel Model for Wireless Underground Sensor Networks.

Authors:  Kanthavel Radhakrishnan; Dhaya Ramakrishnan; Osamah Ibrahim Khalaf; Mueen Uddin; Chin-Ling Chen; Chih-Ming Wu
Journal:  Sensors (Basel)       Date:  2022-06-13       Impact factor: 3.847

2.  A One-Stage Ensemble Framework Based on Convolutional Autoencoder for Remaining Useful Life Estimation.

Authors:  Yong-Keun Park; Min-Kyung Kim; Jumyung Um
Journal:  Sensors (Basel)       Date:  2022-04-06       Impact factor: 3.576

3.  Predicting Activity Duration in Smart Sensing Environments Using Synthetic Data and Partial Least Squares Regression: The Case of Dementia Patients.

Authors:  Miguel Ortiz-Barrios; Eric Järpe; Matías García-Constantino; Ian Cleland; Chris Nugent; Sebastián Arias-Fonseca; Natalia Jaramillo-Rueda
Journal:  Sensors (Basel)       Date:  2022-07-20       Impact factor: 3.847

4.  Study on the Construction of a Time-Space Four-Dimensional Combined Imaging Model and Moving Target Location Prediction Model.

Authors:  Junchao Zhu; Qi Zeng; Fangfang Han; Huifeng Cao; Yongxin Bian; Chenhong Wei
Journal:  Sensors (Basel)       Date:  2022-08-24       Impact factor: 3.847

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.