Literature DB >> 15360875

Really, is medical sublanguage that different? Experimental counter-evidence from tagging medical and newspaper corpora.

Joachim Wermter1, Udo Hahn.   

Abstract

We compare the performance of two part-of-speech taggers trained on a German newspaper corpus for mixed types of medical documents. TnT, a tagger based on a statistical language model, outperforms Brill's rule-based tagger, and supplied with additional lexicon resources matches state-of-the-art performance figures (close to 97% accuracy) on the medical corpus. We explain this unexpected result by focusing on the statistically significant part-of-speech type overlap between the newspaper training set and the medical test set. At least at that level, sublanguage differences seem to vanish. Thus, statistical off-the-shelf part-of-speech taggers can immediately be reused for medical language processing

Entities:  

Mesh:

Year:  2004        PMID: 15360875

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  2 in total

1.  Towards semantic role labeling & IE in the medical literature.

Authors:  Yacov Kogan; Nigel Collier; Serguei Pakhomov; Michael Krauthammer
Journal:  AMIA Annu Symp Proc       Date:  2005

Review 2.  What can natural language processing do for clinical decision support?

Authors:  Dina Demner-Fushman; Wendy W Chapman; Clement J McDonald
Journal:  J Biomed Inform       Date:  2009-08-13       Impact factor: 6.317

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.