Literature DB >> 28344371

DeepDive: Declarative Knowledge Base Construction.

Christopher De Sa1, Alex Ratner1, Christopher Ré1, Jaeho Shin1, Feiran Wang1, Sen Wu1, Ce Zhang1.   

Abstract

The dark data extraction or knowledge base construction (KBC) problem is to populate a SQL database with information from unstructured data sources including emails, webpages, and pdf reports. KBC is a long-standing problem in industry and research that encompasses problems of data extraction, cleaning, and integration. We describe DeepDive, a system that combines database and machine learning ideas to help develop KBC systems. The key idea in DeepDive is that statistical inference and machine learning are key tools to attack classical data problems in extraction, cleaning, and integration in a unified and more effective manner. DeepDive programs are declarative in that one cannot write probabilistic inference algorithms; instead, one interacts by defining features or rules about the domain. A key reason for this design choice is to enable domain experts to build their own KBC systems. We present the applications, abstractions, and techniques of DeepDive employed to accelerate construction of KBC systems.

Entities:  

Year:  2016        PMID: 28344371      PMCID: PMC5361060     

Source DB:  PubMed          Journal:  SIGMOD Rec        ISSN: 0163-5808            Impact factor:   0.775


  3 in total

1.  A machine reading system for assembling synthetic paleontological databases.

Authors:  Shanan E Peters; Ce Zhang; Miron Livny; Christopher Ré
Journal:  PLoS One       Date:  2014-12-01       Impact factor: 3.240

2.  Incremental Knowledge Base Construction Using DeepDive.

Authors:  Jaeho Shin; Sen Wu; Feiran Wang; Christopher De Sa; Ce Zhang; Christopher Ré
Journal:  Proceedings VLDB Endowment       Date:  2015-07

3.  Large-scale extraction of gene interactions from full-text literature using DeepDive.

Authors:  Emily K Mallory; Ce Zhang; Christopher Ré; Russ B Altman
Journal:  Bioinformatics       Date:  2015-09-03       Impact factor: 6.937

  3 in total
  3 in total

1.  Gait biomechanics in the era of data science.

Authors:  Reed Ferber; Sean T Osis; Jennifer L Hicks; Scott L Delp
Journal:  J Biomech       Date:  2016-10-27       Impact factor: 2.712

2.  ICARUS: Minimizing Human Effort in Iterative Data Completion.

Authors:  Protiva Rahman; Courtney Hebert; Arnab Nandi
Journal:  Proceedings VLDB Endowment       Date:  2018-09

3.  Declarative Learning-Based Programming as an Interface to AI Systems.

Authors:  Parisa Kordjamshidi; Dan Roth; Kristian Kersting
Journal:  Front Artif Intell       Date:  2022-03-14
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.