| Literature DB >> 21347064 |
Bill Riedl1, Nhan Than, Michael Hogarth.
Abstract
Cause of death data is an invaluable resource for shaping our understanding of population health. Mortality statistics is one of the principal sources of health information and in many countries the most reliable source of health data. 1 A quick classification process for this data can significantly improve public health efforts. Currently, cause of death data is captured in unstructured form requiring months to process. We think this process can be automated, at least partially, using simple statistical Natural Language Processing, NLP, techniques and the Unified Medical Language System, UMLS, as a vocabulary resource. A system, Medical Match Master, MMM, was built to exercise this theory. We evaluate this simple NLP approach in the classification of causes of death. This technique performed well if we engaged the use of a large biomedical vocabulary and applied certain syntactic maneuvers made possible by textual relationships within the vocabulary.Entities:
Mesh:
Year: 2010 PMID: 21347064 PMCID: PMC3041359
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076