Literature DB >> 32340937

Norm-Preservation: Why Residual Networks Can Become Extremely Deep?

Alireza Zaeemzadeh, Nazanin Rahnavard, Mubarak Shah.   

Abstract

Augmenting neural networks with skip connections, as introduced in the so-called ResNet architecture, surprised the community by enabling the training of networks of more than 1,000 layers with significant performance gains. This paper deciphers ResNet by analyzing the effect of skip connections, and puts forward new theoretical results on the advantages of identity skip connections in neural networks. We prove that the skip connections in the residual blocks facilitate preserving the norm of the gradient, and lead to stable back-propagation, which is desirable from optimization perspective. We also show that, perhaps surprisingly, as more residual blocks are stacked, the norm-preservation of the network is enhanced. Our theoretical arguments are supported by extensive empirical evidence. Can we push for extra norm-preservation? We answer this question by proposing an efficient method to regularize the singular values of the convolution operator and making the ResNet's transition layers extra norm-preserving. Our numerical investigations demonstrate that the learning dynamics and the classification performance of ResNet can be improved by making it even more norm preserving. Our results and the introduced modification for ResNet, referred to as Procrustes ResNets, can be used as a guide for training deeper networks and can also inspire new deeper architectures.

Year:  2020        PMID: 32340937     DOI: 10.1109/TPAMI.2020.2990339

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  3 in total

1.  Fully automatic deep learning trained on limited data for carotid artery segmentation from large image volumes.

Authors:  Tianshu Zhou; Tao Tan; Xiaoyan Pan; Hui Tang; Jingsong Li
Journal:  Quant Imaging Med Surg       Date:  2021-01

Review 2.  Prospective Deployment of Deep Learning in MRI: A Framework for Important Considerations, Challenges, and Recommendations for Best Practices.

Authors:  Akshay S Chaudhari; Christopher M Sandino; Elizabeth K Cole; David B Larson; Garry E Gold; Shreyas S Vasanawala; Matthew P Lungren; Brian A Hargreaves; Curtis P Langlotz
Journal:  J Magn Reson Imaging       Date:  2020-08-24       Impact factor: 5.119

3.  Classification of moving coronary calcified plaques based on motion artifacts using convolutional neural networks: a robotic simulating study on influential factors.

Authors:  Magdalena Dobrolińska; Niels van der Werf; Marcel Greuter; Beibei Jiang; Riemer Slart; Xueqian Xie
Journal:  BMC Med Imaging       Date:  2021-10-19       Impact factor: 1.930

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.