Literature DB >> 29391769

Inferring Generative Model Structure with Static Analysis.

Paroma Varma1, Bryan He1, Payal Bajaj1, Imon Banerjee1, Nishith Khandwala1, Daniel L Rubin1, Christopher Ré1.   

Abstract

Obtaining enough labeled data to robustly train complex discriminative models is a major bottleneck in the machine learning pipeline. A popular solution is combining multiple sources of weak supervision using generative models. The structure of these models affects training label quality, but is difficult to learn without any ground truth labels. We instead rely on these weak supervision sources having some structure by virtue of being encoded programmatically. We present Coral, a paradigm that infers generative model structure by statically analyzing the code for these heuristics, thus reducing the data required to learn structure significantly. We prove that Coral's sample complexity scales quasilinearly with the number of heuristics and number of relations found, improving over the standard sample complexity, which is exponential in n for identifying nth degree relations. Experimentally, Coral matches or outperforms traditional structure learning approaches by up to 3.81 F1 points. Using Coral to model dependencies instead of assuming independence results in better performance than a fully supervised model by 3.07 accuracy points when heuristics are used to label radiology data without ground truth labels.

Entities:  

Year:  2017        PMID: 29391769      PMCID: PMC5789796     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  8 in total

1.  Constructing biological knowledge bases by extracting information from text sources.

Authors:  M Craven; J Kumlien
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1999

2.  Training products of experts by minimizing contrastive divergence.

Authors:  Geoffrey E Hinton
Journal:  Neural Comput       Date:  2002-08       Impact factor: 2.026

3.  Knowledge representation and sharing using visual semantic modeling for diagnostic medical image databases.

Authors:  Adrian S Barb; Chi-Ren Shyu; Yash P Sethi
Journal:  IEEE Trans Inf Technol Biomed       Date:  2005-12

Review 4.  A review of automatic mass detection and segmentation in mammographic images.

Authors:  Arnau Oliver; Jordi Freixenet; Joan Martí; Elsa Pérez; Josep Pont; Erika R E Denton; Reyer Zwiggelaar
Journal:  Med Image Anal       Date:  2009-12-29       Impact factor: 8.545

5.  Automated segmentation of MR images of brain tumors.

Authors:  M R Kaus; S K Warfield; A Nabavi; P M Black; F A Jolesz; R Kikinis
Journal:  Radiology       Date:  2001-02       Impact factor: 11.105

6.  Automated medical image segmentation techniques.

Authors:  Neeraj Sharma; Lalit M Aggarwal
Journal:  J Med Phys       Date:  2010-01

7.  On combining image-based and ontological semantic dissimilarities for medical image retrieval applications.

Authors:  Camille Kurtz; Adrien Depeursinge; Sandy Napel; Christopher F Beaulieu; Daniel L Rubin
Journal:  Med Image Anal       Date:  2014-07-02       Impact factor: 8.545

8.  Incremental Knowledge Base Construction Using DeepDive.

Authors:  Jaeho Shin; Sen Wu; Feiran Wang; Christopher De Sa; Ce Zhang; Christopher Ré
Journal:  Proceedings VLDB Endowment       Date:  2015-07
  8 in total
  3 in total

1.  Training Complex Models with Multi-Task Weak Supervision.

Authors:  Alexander Ratner; Braden Hancock; Jared Dunnmon; Frederic Sala; Shreyash Pandey; Christopher Ré
Journal:  Proc Conf AAAI Artif Intell       Date:  2019 Jan-Feb

2.  Snuba: Automating Weak Supervision to Label Training Data.

Authors:  Paroma Varma; Christopher Ré
Journal:  Proceedings VLDB Endowment       Date:  2018-11

3.  Weakly supervised classification of aortic valve malformations using unlabeled cardiac MRI sequences.

Authors:  Jason A Fries; Paroma Varma; Vincent S Chen; Ke Xiao; Heliodoro Tejeda; Priyanka Saha; Jared Dunnmon; Henry Chubb; Shiraz Maskatia; Madalina Fiterau; Scott Delp; Euan Ashley; Christopher Ré; James R Priest
Journal:  Nat Commun       Date:  2019-07-15       Impact factor: 14.919

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.