Literature DB >> 31634826

Orthogonal Deep Neural Networks.

Shuai Li, Kui Jia, Yuxin Wen, Tongliang Liu, Dacheng Tao.   

Abstract

In this paper, we introduce the algorithms of Orthogonal Deep Neural Networks (OrthDNNs) to connect with recent interest of spectrally regularized deep learning methods. OrthDNNs are theoretically motivated by generalization analysis of modern DNNs, with the aim to find solution properties of network weights that guarantee better generalization. To this end, we first prove that DNNs are of local isometry on data distributions of practical interest; by using a new covering of the sample space and introducing the local isometry property of DNNs into generalization analysis, we establish a new generalization error bound that is both scale- and range-sensitive to singular value spectrum of each of networks' weight matrices. We prove that the optimal bound w.r.t. the degree of isometry is attained when each weight matrix has a spectrum of equal singular values, among which orthogonal weight matrix or a non-square one with orthonormal rows or columns is the most straightforward choice, suggesting the algorithms of OrthDNNs. We present both algorithms of strict and approximate OrthDNNs, and for the later ones we propose a simple yet effective algorithm called Singular Value Bounding (SVB), which performs as well as strict OrthDNNs, but at a much lower computational cost. We also propose Bounded Batch Normalization (BBN) to make compatible use of batch normalization with OrthDNNs. We conduct extensive comparative studies by using modern architectures on benchmark image classification. Experiments show the efficacy of OrthDNNs.

Entities:  

Mesh:

Year:  2021        PMID: 31634826     DOI: 10.1109/TPAMI.2019.2948352

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  2 in total

1.  Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer.

Authors:  Junya Chen; Zidi Xiu; Benjamin A Goldstein; Ricardo Henao; Lawrence Carin; Chenyang Tao
Journal:  Adv Neural Inf Process Syst       Date:  2021-12

2.  Unsupervised Event Graph Representation and Similarity Learning on Biomedical Literature.

Authors:  Giacomo Frisoni; Gianluca Moro; Giulio Carlassare; Antonella Carbonaro
Journal:  Sensors (Basel)       Date:  2021-12-21       Impact factor: 3.576

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.