Literature DB >> 25887686

Wide-coverage relation extraction from MEDLINE using deep syntax.

Nhung T H Nguyen1, Makoto Miwa2, Yoshimasa Tsuruoka3, Takashi Chikayama4, Satoshi Tojo5.   

Abstract

BACKGROUND: Relation extraction is a fundamental technology in biomedical text mining. Most of the previous studies on relation extraction from biomedical literature have focused on specific or predefined types of relations, which inherently limits the types of the extracted relations. With the aim of fully leveraging the knowledge described in the literature, we address much broader types of semantic relations using a single extraction framework.
RESULTS: Our system, which we name PASMED, extracts diverse types of binary relations from biomedical literature using deep syntactic patterns. Our experimental results demonstrate that it achieves a level of recall considerably higher than the state of the art, while maintaining reasonable precision. We have then applied PASMED to the whole MEDLINE corpus and extracted more than 137 million semantic relations. The extracted relations provide a quantitative understanding of what kinds of semantic relations are actually described in MEDLINE and can be ultimately extracted by (possibly type-specific) relation extraction systems.
CONCLUSION: PASMED extracts a large number of relations that have previously been missed by existing text mining systems. The entire collection of the relations extracted from MEDLINE is publicly available in machine-readable form, so that it can serve as a potential knowledge base for high-level text-mining applications.

Entities:  

Mesh:

Year:  2015        PMID: 25887686      PMCID: PMC4396593          DOI: 10.1186/s12859-015-0538-8

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

2.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Extracting human protein interactions from MEDLINE using a full-sentence parser.

Authors:  Nikolai Daraselia; Anton Yuryev; Sergei Egorov; Svetalana Novichkova; Alexander Nikitin; Ilya Mazo
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

4.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

5.  RelEx--relation extraction using dependency parse trees.

Authors:  Katrin Fundel; Robert Küffner; Ralf Zimmer
Journal:  Bioinformatics       Date:  2006-12-01       Impact factor: 6.937

6.  Extraction of gene-disease relations from Medline using domain dictionaries and machine learning.

Authors:  Hong-Woo Chun; Yoshimasa Tsuruoka; Jin-Dong Kim; Rie Shiba; Naoki Nagata; Teruyoshi Hishiki; Jun'ichi Tsujii
Journal:  Pac Symp Biocomput       Date:  2006

7.  Large-scale event extraction from literature with multi-level gene normalization.

Authors:  Sofie Van Landeghem; Jari Björne; Chih-Hsuan Wei; Kai Hakala; Sampo Pyysalo; Sophia Ananiadou; Hung-Yu Kao; Zhiyong Lu; Tapio Salakoski; Yves Van de Peer; Filip Ginter
Journal:  PLoS One       Date:  2013-04-17       Impact factor: 3.240

Review 8.  A critical review of PASBio's argument structures for biomedical verbs.

Authors:  K Bretonnel Cohen; Lawrence Hunter
Journal:  BMC Bioinformatics       Date:  2006-11-24       Impact factor: 3.169

9.  PASBio: predicate-argument structures for event extraction in molecular biology.

Authors:  Tuangthong Wattarujeekrit; Parantu K Shah; Nigel Collier
Journal:  BMC Bioinformatics       Date:  2004-10-19       Impact factor: 3.169

10.  Large-scale extraction of accurate drug-disease treatment pairs from biomedical literature for drug repurposing.

Authors:  Rong Xu; Quanqiu Wang
Journal:  BMC Bioinformatics       Date:  2013-06-06       Impact factor: 3.169

View more
  6 in total

1.  Learning Inter-Sentence, Disorder-Centric, Biomedical Relationships from Medical Literature.

Authors:  Anton H van der Vegt; Guido Zuccon; Bevan Koopman
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

2.  Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.

Authors:  Billy Chiu; Sampo Pyysalo; Ivan Vulić; Anna Korhonen
Journal:  BMC Bioinformatics       Date:  2018-02-05       Impact factor: 3.169

3.  COPIOUS: A gold standard corpus of named entities towards extracting species occurrence from biodiversity literature.

Authors:  Nhung T H Nguyen; Roselyn S Gabud; Sophia Ananiadou
Journal:  Biodivers Data J       Date:  2019-01-22

4.  Using a Large Margin Context-Aware Convolutional Neural Network to Automatically Extract Disease-Disease Association from Literature: Comparative Analytic Study.

Authors:  Richard Tzong-Han Tsai; Jorng-Tzong Horng; Po-Ting Lai; Wei-Liang Lu; Ting-Rung Kuo; Chia-Ru Chung; Jen-Chieh Han
Journal:  JMIR Med Inform       Date:  2019-11-26

5.  Support Vector Machine with Ensemble Tree Kernel for Relation Extraction.

Authors:  Xiaoyong Liu; Hui Fu; Zhiguo Du
Journal:  Comput Intell Neurosci       Date:  2016-03-22

6.  Identifying genotype-phenotype relationships in biomedical text.

Authors:  Maryam Khordad; Robert E Mercer
Journal:  J Biomed Semantics       Date:  2017-12-06
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.