Literature DB >> 29375240

Learning to Compose Domain-Specific Transformations for Data Augmentation.

Alexander J Ratner1, Henry R Ehrenberg1, Zeshan Hussain1, Jared Dunnmon1, Christopher Ré1.   

Abstract

Data augmentation is a ubiquitous technique for increasing the size of labeled training sets by leveraging task-specific data transformations that preserve class labels. While it is often easy for domain experts to specify individual transformations, constructing and tuning the more sophisticated compositions typically needed to achieve state-of-the-art results is a time-consuming manual task in practice. We propose a method for automating this process by learning a generative sequence model over user-specified transformation functions using a generative adversarial approach. Our method can make use of arbitrary, non-deterministic transformation functions, is robust to misspecified user input, and is trained on unlabeled data. The learned transformation model can then be used to perform data augmentation for any end discriminative model. In our experiments, we show the efficacy of our approach on both image and text datasets, achieving improvements of 4.0 accuracy points on CIFAR-10, 1.4 F1 points on the ACE relation extraction task, and 3.4 accuracy points when using domain-specific transformation operations on a medical imaging dataset as compared to standard heuristic augmentation approaches.

Entities:  

Year:  2017        PMID: 29375240      PMCID: PMC5786274     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  4 in total

1.  Deep, big, simple neural nets for handwritten digit recognition.

Authors:  Dan Claudiu Cireşan; Ueli Meier; Luca Maria Gambardella; Jürgen Schmidhuber
Journal:  Neural Comput       Date:  2010-09-21       Impact factor: 2.026

2.  Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks.

Authors:  Alexey Dosovitskiy; Philipp Fischer; Jost Tobias Springenberg; Martin Riedmiller; Thomas Brox
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-10-29       Impact factor: 6.226

3.  Enhancing text categorization with semantic-enriched representation and training data augmentation.

Authors:  Xinghua Lu; Bin Zheng; Atulya Velivelli; Chengxiang Zhai
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

4.  The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository.

Authors:  Kenneth Clark; Bruce Vendt; Kirk Smith; John Freymann; Justin Kirby; Paul Koppel; Stephen Moore; Stanley Phillips; David Maffitt; Michael Pringle; Lawrence Tarbox; Fred Prior
Journal:  J Digit Imaging       Date:  2013-12       Impact factor: 4.056

  4 in total
  19 in total

1.  CAESNet: Convolutional AutoEncoder based Semi-supervised Network for improving multiclass classification of endomicroscopic images.

Authors:  Li Tong; Hang Wu; May D Wang
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  A Kernel Theory of Modern Data Augmentation.

Authors:  Tri Dao; Albert Gu; Alexander J Ratner; Virginia Smith; Christopher De Sa; Christopher Ré
Journal:  Proc Mach Learn Res       Date:  2019-06

3.  Learning from dispersed manual annotations with an optimized data weighting policy.

Authors:  Yucheng Tang; Riqiang Gao; Yunqiang Chen; Dashan Gao; Michael R Savona; Richard G Abramson; Shunxing Bao; Yuankai Huo; Bennett A Landman
Journal:  J Med Imaging (Bellingham)       Date:  2020-07-30

4.  An automatic behavior recognition system classifies animal behaviors using movements and their temporal context.

Authors:  Primoz Ravbar; Kristin Branson; Julie H Simpson
Journal:  J Neurosci Methods       Date:  2019-08-12       Impact factor: 2.390

5.  Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks.

Authors:  Shuyue Guan; Murray Loew
Journal:  J Med Imaging (Bellingham)       Date:  2019-03-23

6.  Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Authors:  Luke Oakden-Rayner; Jared Dunnmon; Gustavo Carneiro; Christopher Ré
Journal:  Proc ACM Conf Health Inference Learn (2020)       Date:  2020-04

Review 7.  Artificial intelligence applications for pediatric oncology imaging.

Authors:  Heike Daldrup-Link
Journal:  Pediatr Radiol       Date:  2019-10-16

8.  Comparison of segmentation-free and segmentation-dependent computer-aided diagnosis of breast masses on a public mammography dataset.

Authors:  Rebecca Sawyer Lee; Jared A Dunnmon; Ann He; Siyi Tang; Christopher Ré; Daniel L Rubin
Journal:  J Biomed Inform       Date:  2020-12-11       Impact factor: 6.317

9.  Selective synthetic augmentation with HistoGAN for improved histopathology image classification.

Authors:  Yuan Xue; Jiarong Ye; Qianying Zhou; L Rodney Long; Sameer Antani; Zhiyun Xue; Carl Cornwell; Richard Zaino; Keith C Cheng; Xiaolei Huang
Journal:  Med Image Anal       Date:  2020-10-01       Impact factor: 8.545

10.  Training deep-learning segmentation models from severely limited data.

Authors:  Yao Zhao; Dong Joo Rhee; Carlos Cardenas; Laurence E Court; Jinzhong Yang
Journal:  Med Phys       Date:  2021-02-19       Impact factor: 4.071

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.