Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices.

Literature DB >> 31871391

Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices.

Vincent S Chen¹, Sen Wu¹, Zhenzhen Weng¹, Alexander Ratner¹, Christopher Ré¹.
1. Stanford University.

Abstract

In real-world machine learning applications, data subsets correspond to especially critical outcomes: vulnerable cyclist detections are safety-critical in an autonomous driving task, and "question" sentences might be important to a dialogue agent's language understanding for product purposes. While machine learning models can achieve high quality performance on coarse-grained metrics like F1-score and overall accuracy, they may underperform on critical subsets-we define these as slices, the key abstraction in our approach. To address slice-level performance, practitioners often train separate "expert" models on slice subsets or use multi-task hard parameter sharing. We propose Slice-based Learning, a new programming model in which the slicing function (SF), a programming interface, specifies critical data subsets for which the model should commit additional capacity. Any model can leverage SFs to learn slice expert representations, which are combined with an attention mechanism to make slice-aware predictions. We show that our approach maintains a parameter-efficient representation while improving over baselines by up to 19.0 F1 on slices and 4.6 F1 overall on datasets spanning language understanding (e.g. SuperGLUE), computer vision, and production-scale industrial systems.

Entities: Chemical Disease Gene Species

Year: 2019 PMID： 31871391 PMCID： PMC6927210

Source DB: PubMed Journal: Adv Neural Inf Process Syst ISSN： 1049-5258

5 in total

1. Comparison of the predicted and observed secondary structure of T4 phage lysozyme.

Authors: B W Matthews
Journal: Biochim Biophys Acta Date: 1975-10-20

2. Adaptive Mixtures of Local Experts.

Authors: Robert A Jacobs; Michael I Jordan; Steven J Nowlan; Geoffrey E Hinton
Journal: Neural Comput Date: 1991 Impact factor: 2.026

3. Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale.

Authors: Stephen H Bach; Daniel Rodriguez; Yintao Liu; Chong Luo; Haidong Shao; Cassandra Xia; Souvik Sen; Alex Ratner; Braden Hancock; Houman Alborzi; Rahul Kuchhal; Chris Ré; Rob Malkin
Journal: Proc ACM SIGMOD Int Conf Manag Data Date: 2019 Jun-Jul

4. Data Programming: Creating Large Training Sets, Quickly.

Authors: Alexander Ratner; Christopher De Sa; Sen Wu; Daniel Selsam; Christopher Ré
Journal: Adv Neural Inf Process Syst Date: 2016-12

5. Snorkel: Rapid Training Data Creation with Weak Supervision.

Authors: Alexander Ratner; Stephen H Bach; Henry Ehrenberg; Jason Fries; Sen Wu; Christopher Ré
Journal: Proceedings VLDB Endowment Date: 2017-11

5 in total

1 in total

1. Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.

Authors: Luke Oakden-Rayner; Jared Dunnmon; Gustavo Carneiro; Christopher Ré
Journal: Proc ACM Conf Health Inference Learn (2020) Date: 2020-04

1 in total