| Literature DB >> 18999169 |
Rong Xu1, Kaustubh Supekar, Alex Morgan, Amar Das, Alan Garber.
Abstract
Concept specific lexicons (e.g. diseases, drugs, anatomy) are a critical source of background knowledge for many medical language-processing systems. However, the rapid pace of biomedical research and the lack of constraints on usage ensure that such dictionaries are incomplete. Focusing on disease terminology, we have developed an automated, unsupervised, iterative pattern learning approach for constructing a comprehensive medical dictionary of disease terms from randomized clinical trial (RCT) abstracts, and we compared different ranking methods for automatically extracting con-textual patterns and concept terms. When used to identify disease concepts from 100 randomly chosen, manually annotated clinical abstracts, our disease dictionary shows significant performance improvement (F1 increased by 35-88%) over available, manually created disease terminologies.Mesh:
Year: 2008 PMID: 18999169 PMCID: PMC2656087
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076