Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A Hybrid Human-Computer Approach to the Extraction of Scientific Facts from the Literature.

Literature DB >> 28649288

A Hybrid Human-Computer Approach to the Extraction of Scientific Facts from the Literature.

Roselyne B Tchoua¹, Kyle Chard², Debra Audus³, Jian Qin⁴, Juan de Pablo⁵, Ian Foster^1,2,6.

Abstract

A wealth of valuable data is locked within the millions of research articles published each year. Reading and extracting pertinent information from those articles has become an unmanageable task for scientists. This problem hinders scientific progress by making it hard to build on results buried in literature. Moreover, these data are loosely structured, encoded in manuscripts of various formats, embedded in different content types, and are, in general, not machine accessible. We present a hybrid human-computer solution for semi-automatically extracting scientific facts from literature. This solution combines an automated discovery, download, and extraction phase with a semi-expert crowd assembled from students to extract specific scientific facts. To evaluate our approach we apply it to a challenging molecular engineering scenario, extraction of a polymer property: the Flory-Huggins interaction parameter. We demonstrate useful contributions to a comprehensive database of polymer properties.

Entities: Chemical Disease Gene Species

Keywords: Classification; Crowdsourcing; Flory-Huggins; Information Extraction; Materials Science

Year: 2016 PMID： 28649288 PMCID： PMC5482373 DOI： 10.1016/j.procs.2016.05.338

Source DB: PubMed Journal: Procedia Comput Sci

5 in total

1. GenBank.

Authors: D A Benson; I Karsch-Mizrachi; D J Lipman; J Ostell; B A Rapp; D L Wheeler
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data.

Authors: Andrey Rzhetsky; Ivan Iossifov; Tomohiro Koike; Michael Krauthammer; Pauline Kra; Mitzi Morris; Hong Yu; Pablo Ariel Duboué; Wubin Weng; W John Wilbur; Vasileios Hatzivassiloglou; Carol Friedman
Journal: J Biomed Inform Date: 2004-02 Impact factor: 6.317

3. The NCBI dbGaP database of genotypes and phenotypes.

Authors: Matthew D Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeffrey Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan Graeff; James Ostell; Stephen T Sherry
Journal: Nat Genet Date: 2007-10 Impact factor: 38.330

4. Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data?

Authors: Michael Buhrmester; Tracy Kwang; Samuel D Gosling
Journal: Perspect Psychol Sci Date: 2011-02-03

5. Incremental Knowledge Base Construction Using DeepDive.

Authors: Jaeho Shin; Sen Wu; Feiran Wang; Christopher De Sa; Ce Zhang; Christopher Ré
Journal: Proceedings VLDB Endowment Date: 2015-07

5 in total

2 in total

1. Polymer Informatics: Opportunities and Challenges.

Authors: Debra J Audus; Juan J de Pablo
Journal: ACS Macro Lett Date: 2017-09-15 Impact factor: 6.903

2. Blending Education and Polymer Science: Semi Automated Creation of a Thermodynamic Property Database.

Authors: Roselyne B Tchoua; Jian Qin; Debra J Audus; Kyle Chard; Ian T Foster; Juan de Pablo
Journal: J Chem Educ Date: 2016-08-15 Impact factor: 2.979

2 in total