Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Really, is medical sublanguage that different? Experimental counter-evidence from tagging medical and newspaper corpora.

Literature DB >> 15360875

Really, is medical sublanguage that different? Experimental counter-evidence from tagging medical and newspaper corpora.

Abstract

We compare the performance of two part-of-speech taggers trained on a German newspaper corpus for mixed types of medical documents. TnT, a tagger based on a statistical language model, outperforms Brill's rule-based tagger, and supplied with additional lexicon resources matches state-of-the-art performance figures (close to 97% accuracy) on the medical corpus. We explain this unexpected result by focusing on the statistically significant part-of-speech type overlap between the newspaper training set and the medical test set. At least at that level, sublanguage differences seem to vanish. Thus, statistical off-the-shelf part-of-speech taggers can immediately be reused for medical language processing

Entities: Species

Mesh：

Year: 2004 PMID： 15360875

Source DB: PubMed Journal: Stud Health Technol Inform ISSN： 0926-9630

Keyword Cloud
Cited

2 in total

1. Towards semantic role labeling & IE in the medical literature.

Authors: Yacov Kogan; Nigel Collier; Serguei Pakhomov; Michael Krauthammer
Journal: AMIA Annu Symp Proc Date: 2005

Review 2. What can natural language processing do for clinical decision support?

Authors: Dina Demner-Fushman; Wendy W Chapman; Clement J McDonald
Journal: J Biomed Inform Date: 2009-08-13 Impact factor: 6.317

2 in total