Literature DB >> 31871391

Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices.

Vincent S Chen1, Sen Wu1, Zhenzhen Weng1, Alexander Ratner1, Christopher Ré1.   

Abstract

In real-world machine learning applications, data subsets correspond to especially critical outcomes: vulnerable cyclist detections are safety-critical in an autonomous driving task, and "question" sentences might be important to a dialogue agent's language understanding for product purposes. While machine learning models can achieve high quality performance on coarse-grained metrics like F1-score and overall accuracy, they may underperform on critical subsets-we define these as slices, the key abstraction in our approach. To address slice-level performance, practitioners often train separate "expert" models on slice subsets or use multi-task hard parameter sharing. We propose Slice-based Learning, a new programming model in which the slicing function (SF), a programming interface, specifies critical data subsets for which the model should commit additional capacity. Any model can leverage SFs to learn slice expert representations, which are combined with an attention mechanism to make slice-aware predictions. We show that our approach maintains a parameter-efficient representation while improving over baselines by up to 19.0 F1 on slices and 4.6 F1 overall on datasets spanning language understanding (e.g. SuperGLUE), computer vision, and production-scale industrial systems.

Entities:  

Year:  2019        PMID: 31871391      PMCID: PMC6927210     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  5 in total

1.  Comparison of the predicted and observed secondary structure of T4 phage lysozyme.

Authors:  B W Matthews
Journal:  Biochim Biophys Acta       Date:  1975-10-20

2.  Adaptive Mixtures of Local Experts.

Authors:  Robert A Jacobs; Michael I Jordan; Steven J Nowlan; Geoffrey E Hinton
Journal:  Neural Comput       Date:  1991       Impact factor: 2.026

3.  Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale.

Authors:  Stephen H Bach; Daniel Rodriguez; Yintao Liu; Chong Luo; Haidong Shao; Cassandra Xia; Souvik Sen; Alex Ratner; Braden Hancock; Houman Alborzi; Rahul Kuchhal; Chris Ré; Rob Malkin
Journal:  Proc ACM SIGMOD Int Conf Manag Data       Date:  2019 Jun-Jul

4.  Data Programming: Creating Large Training Sets, Quickly.

Authors:  Alexander Ratner; Christopher De Sa; Sen Wu; Daniel Selsam; Christopher Ré
Journal:  Adv Neural Inf Process Syst       Date:  2016-12

5.  Snorkel: Rapid Training Data Creation with Weak Supervision.

Authors:  Alexander Ratner; Stephen H Bach; Henry Ehrenberg; Jason Fries; Sen Wu; Christopher Ré
Journal:  Proceedings VLDB Endowment       Date:  2017-11
  5 in total
  1 in total

1.  Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Authors:  Luke Oakden-Rayner; Jared Dunnmon; Gustavo Carneiro; Christopher Ré
Journal:  Proc ACM Conf Health Inference Learn (2020)       Date:  2020-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.