Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Comparing syntactic complexity in medical and non-medical corpora.

Literature DB >> 11825160

Comparing syntactic complexity in medical and non-medical corpora.

Abstract

With the growing use of Natural Language Processing (NLP) techniques as solutions in Medical Informatics, the need to quickly and efficiently create the knowledge structures used by these systems has grown concurrently. Automatic discovery of a lexicon for use by an NLP system through machine learning will require information about the syntax of medical language. Understanding the syntactic differences between medical and non-medical corpora may allow more efficient acquisition of a lexicon. Three experiments designed to quantify the syntactic differences in medical and non-medical corpora were conducted. The results show that the syntax of medical language shows less variation than non-medical language and is likely simpler. The differences were great enough to question the applicability of general language tools on medical language. These differences may reduce the difficulty of some free text machine learning problems by capitalizing on the simpler nature of narrative medical syntax.

Mesh：

Year: 2001 PMID： 11825160 PMCID： PMC2243419

Source DB: PubMed Journal: Proc AMIA Symp ISSN： 1531-605X

4 in total

1. Natural language processing and its future in medicine.

Authors: C Friedman; G Hripcsak
Journal: Acad Med Date: 1999-08 Impact factor: 6.893

2. Medical text representations for inductive learning.

Authors: A Wilcox; G Hripcsak
Journal: Proc AMIA Symp Date: 2000

3. A natural language parsing system for encoding admitting diagnoses.

Authors: P J Haug; L Christensen; M Gundersen; B Clemons; S Koehler; K Bauer
Journal: Proc AMIA Annu Fall Symp Date: 1997

4. Towards a comprehensive medical language processing system: methods and issues.

Authors: C Friedman
Journal: Proc AMIA Annu Fall Symp Date: 1997

4 in total

5 in total

1. Document clustering of clinical narratives: a systematic study of clinical sublanguages.

Authors: Olga Patterson; John F Hurdle
Journal: AMIA Annu Symp Proc Date: 2011-10-22

2. Heuristic sample selection to minimize reference standard training set for a part-of-speech tagger.

Authors: Kaihong Liu; Wendy Chapman; Rebecca Hwa; Rebecca S Crowley
Journal: J Am Med Inform Assoc Date: 2007-06-28 Impact factor: 4.497

3. Making texts in electronic health records comprehensible to consumers: a prototype translator.

Authors: Qing Zeng-Treitler; Sergey Goryachev; Hyeoneui Kim; Alla Keselman; Douglas Rosendale
Journal: AMIA Annu Symp Proc Date: 2007-10-11

4. Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation.

Authors: Jeffrey P Ferraro; Hal Daumé; Scott L Duvall; Wendy W Chapman; Henk Harkema; Peter J Haug
Journal: J Am Med Inform Assoc Date: 2013-03-13 Impact factor: 4.497

5. Document Sublanguage Clustering to Detect Medical Specialty in Cross-institutional Clinical Texts.

Authors: Kristina Doing-Harris; Olga Patterson; Sean Igo; John Hurdle
Journal: Proc ACM Int Workshop Data Text Min Biomed Inform Date: 2013 Oct-Nov

5 in total