| Literature DB >> 33652370 |
Jin-Woo Seo1, Hong-Gyu Jung1, Seong-Whan Lee2.
Abstract
Few-shot learning aims to classify unseen classes with a few training examples. While recent works have shown that standard mini-batch training with carefully designed training strategies can improve generalization ability for unseen classes, well-known problems in deep networks such as memorizing training statistics have been less explored for few-shot learning. To tackle this issue, we propose self-augmentation that consolidates self-mix and self-distillation. Specifically, we propose a regional dropout technique called self-mix, in which a patch of an image is substituted into other values in the same image. With this dropout effect, we show that the generalization ability of deep networks can be improved as it prevents us from learning specific structures of a dataset. Then, we employ a backbone network that has auxiliary branches with its own classifier to enforce knowledge sharing. This sharing of knowledge forces each branch to learn diverse optimal points during training. Additionally, we present a local representation learner to further exploit a few training examples of unseen classes by generating fake queries and novel weights. Experimental results show that the proposed method outperforms the state-of-the-art methods for prevalent few-shot benchmarks and improves the generalization ability.Keywords: Classification; Few-shot learning; Generalization; Knowledge distillation
Mesh:
Year: 2021 PMID: 33652370 DOI: 10.1016/j.neunet.2021.02.007
Source DB: PubMed Journal: Neural Netw ISSN: 0893-6080