| Literature DB >> 9357693 |
A Nazarenko1, P Zweigenbaum, J Bouaud, B Habert.
Abstract
Medical Language Processing (MLP), especially in specific domains, requires fine-grained semantic lexica. We examine whether robust natural language processing tools used on a representative corpus of a domain help in building and refining a semantic categorization. We test this hypothesis with ZELLIG, a corpus analysis tool. The first clusters we obtain are consistent with a model of the domain, as found in the SNOMED nomenclature. They correspond to coarse-grained semantic categories, but isolate as well lexical idiosyncrasies belonging to the clinical sub-language. Moreover, they help categorize additional words.Mesh:
Year: 1997 PMID: 9357693 PMCID: PMC2233482
Source DB: PubMed Journal: Proc AMIA Annu Fall Symp ISSN: 1091-8280