Literature DB >> 26958228

Knowledge Extraction from MEDLINE by Combining Clustering with Natural Language Processing.

Jose A Miñarro-Giménez1, Markus Kreuzthaler1, Stefan Schulz1.   

Abstract

The identification of relevant predicates between co-occurring concepts in scientific literature databases like MEDLINE is crucial for using these sources for knowledge extraction, in order to obtain meaningful biomedical predications as subject-predicate-object triples. We consider the manually assigned MeSH indexing terms (main headings and subheadings) in MEDLINE records as a rich resource for extracting a broad range of domain knowledge. In this paper, we explore the combination of a clustering method for co-occurring concepts based on their related MeSH subheadings in MEDLINE with the use of SemRep, a natural language processing engine, which extracts predications from free text documents. As a result, we generated sets of clusters of co-occurring concepts and identified the most significant predicates for each cluster. The association of such predicates with the co-occurrences of the resulting clusters produces the list of predications, which were checked for relevance.

Mesh:

Year:  2015        PMID: 26958228      PMCID: PMC4765595     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  11 in total

1.  Automated knowledge extraction from MEDLINE citations.

Authors:  E A Mendonça; J J Cimino
Journal:  Proc AMIA Symp       Date:  2000

2.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

3.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Distilling conceptual connections from MeSH co-occurrences.

Authors:  Padmini Srinivasan; Dimitar Hristovski
Journal:  Stud Health Technol Inform       Date:  2004

5.  Knowledge discovery by automated identification and ranking of implicit relationships.

Authors:  Jonathan D Wren; Raffi Bekeredjian; Jelena A Stewart; Ralph V Shohet; Harold R Garner
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

6.  Non-lexical approaches to identifying associative relations in the gene ontology.

Authors:  Olivier Bodenreider; Marc Aubry; Anita Burgun
Journal:  Pac Symp Biocomput       Date:  2005

7.  Genestrace: phenomic knowledge discovery via structured terminology.

Authors:  Michael N Cantor; Indra Neil Sarkar; Olivier Bodenreider; Yves A Lussier
Journal:  Pac Symp Biocomput       Date:  2005

Review 8.  A survey of current work in biomedical text mining.

Authors:  Aaron M Cohen; William R Hersh
Journal:  Brief Bioinform       Date:  2005-03       Impact factor: 11.622

9.  Large-scale directional relationship extraction and resolution.

Authors:  Cory B Giles; Jonathan D Wren
Journal:  BMC Bioinformatics       Date:  2008-08-12       Impact factor: 3.169

Review 10.  Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends.

Authors:  Emad A Mohammed; Behrouz H Far; Christopher Naugler
Journal:  BioData Min       Date:  2014-10-29       Impact factor: 2.522

View more
  1 in total

1.  Creation of Individual Scientific Concept-Centered Semantic Maps Based on Automated Text-Mining Analysis of PubMed.

Authors:  Ekaterina Ilgisonis; Andrey Lisitsa; Valerya Kudryavtseva; Elena Ponomarenko
Journal:  Adv Bioinformatics       Date:  2018-07-26
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.