Literature DB >> 33476265

Regularizing Deep Networks With Semantic Data Augmentation.

Yulin Wang, Gao Huang, Shiji Song, Xuran Pan, Yitong Xia, Cheng Wu.   

Abstract

Data augmentation is widely known as a simple yet surprisingly effective technique for regularizing deep networks. Conventional data augmentation schemes, e.g., flipping, translation or rotation, are low-level, data-independent and class-agnostic operations, leading to limited diversity for augmented samples. To this end, we propose a novel semantic data augmentation algorithm to complement traditional approaches. The proposed method is inspired by the intriguing property that deep networks are effective in learning linearized features, i.e., certain directions in the deep feature space correspond to meaningful semantic transformations, e.g., changing the background or view angle of an object. Based on this observation, translating training samples along many such directions in the feature space can effectively augment the dataset for more diversity. To implement this idea, we first introduce a sampling based method to obtain semantically meaningful directions efficiently. Then, an upper bound of the expected cross-entropy (CE) loss on the augmented training set is derived by assuming the number of augmented samples goes to infinity, yielding a highly efficient algorithm. In fact, we show that the proposed implicit semantic data augmentation (ISDA) algorithm amounts to minimizing a novel robust CE loss, which adds minimal extra computational cost to a normal training procedure. In addition to supervised learning, ISDA can be applied to semi-supervised learning tasks under the consistency regularization framework, where ISDA amounts to minimizing the upper bound of the expected KL-divergence between the augmented features and the original features. Although being simple, ISDA consistently improves the generalization performance of popular deep models (e.g., ResNets and DenseNets) on a variety of datasets, i.e., CIFAR-10, CIFAR-100, SVHN, ImageNet, and Cityscapes. Code for reproducing our results is available at https://github.com/blackfeather-wang/ISDA-for-Deep-Networks.

Entities:  

Year:  2022        PMID: 33476265     DOI: 10.1109/TPAMI.2021.3052951

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  4 in total

1.  AntiDMPpred: a web service for identifying anti-diabetic peptides.

Authors:  Xue Chen; Jian Huang; Bifang He
Journal:  PeerJ       Date:  2022-06-14       Impact factor: 3.061

2.  A Novel Quick-Response Eigenface Analysis Scheme for Brain-Computer Interfaces.

Authors:  Hojong Choi; Junghun Park; Yeon-Mo Yang
Journal:  Sensors (Basel)       Date:  2022-08-05       Impact factor: 3.847

Review 3.  A Review of Performance Prediction Based on Machine Learning in Materials Science.

Authors:  Ziyang Fu; Weiyi Liu; Chen Huang; Tao Mei
Journal:  Nanomaterials (Basel)       Date:  2022-08-26       Impact factor: 5.719

4.  ACP-DA: Improving the Prediction of Anticancer Peptides Using Data Augmentation.

Authors:  Xian-Gan Chen; Wen Zhang; Xiaofei Yang; Chenhong Li; Hengling Chen
Journal:  Front Genet       Date:  2021-06-30       Impact factor: 4.599

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.