| Literature DB >> 26958228 |
Jose A Miñarro-Giménez1, Markus Kreuzthaler1, Stefan Schulz1.
Abstract
The identification of relevant predicates between co-occurring concepts in scientific literature databases like MEDLINE is crucial for using these sources for knowledge extraction, in order to obtain meaningful biomedical predications as subject-predicate-object triples. We consider the manually assigned MeSH indexing terms (main headings and subheadings) in MEDLINE records as a rich resource for extracting a broad range of domain knowledge. In this paper, we explore the combination of a clustering method for co-occurring concepts based on their related MeSH subheadings in MEDLINE with the use of SemRep, a natural language processing engine, which extracts predications from free text documents. As a result, we generated sets of clusters of co-occurring concepts and identified the most significant predicates for each cluster. The association of such predicates with the co-occurrences of the resulting clusters produces the list of predications, which were checked for relevance.Mesh:
Year: 2015 PMID: 26958228 PMCID: PMC4765595
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076