Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Cue-based assertion classification for Swedish clinical text--developing a lexicon for pyConTextSwe.

Literature DB >> 24556644

Cue-based assertion classification for Swedish clinical text--developing a lexicon for pyConTextSwe.

Sumithra Velupillai¹, Maria Skeppstedt², Maria Kvist³, Danielle Mowery⁴, Brian E Chapman⁵, Hercules Dalianis⁶, Wendy W Chapman⁷.

Abstract

OBJECTIVE: The ability of a cue-based system to accurately assert whether a disorder is affirmed, negated, or uncertain is dependent, in part, on its cue lexicon. In this paper, we continue our study of porting an assertion system (pyConTextNLP) from English to Swedish (pyConTextSwe) by creating an optimized assertion lexicon for clinical Swedish. METHODS AND MATERIAL: We integrated cues from four external lexicons, along with generated inflections and combinations. We used subsets of a clinical corpus in Swedish. We applied four assertion classes (definite existence, probable existence, probable negated existence and definite negated existence) and two binary classes (existence yes/no and uncertainty yes/no) to pyConTextSwe. We compared pyConTextSwe's performance with and without the added cues on a development set, and improved the lexicon further after an error analysis. On a separate evaluation set, we calculated the system's final performance.
RESULTS: Following integration steps, we added 454 cues to pyConTextSwe. The optimized lexicon developed after an error analysis resulted in statistically significant improvements on the development set (83% F-score, overall). The system's final F-scores on an evaluation set were 81% (overall). For the individual assertion classes, F-score results were 88% (definite existence), 81% (probable existence), 55% (probable negated existence), and 63% (definite negated existence). For the binary classifications existence yes/no and uncertainty yes/no, final system performance was 97%/87% and 78%/86% F-score, respectively.
CONCLUSIONS: We have successfully ported pyConTextNLP to Swedish (pyConTextSwe). We have created an extensive and useful assertion lexicon for Swedish clinical text, which could form a valuable resource for similar studies, and which is publicly available.

Entities: Chemical Disease Gene Species

Keywords: Assertion classification; Clinical text mining; Dictionaries; Electronic health records; Information extraction; Medical Language Processing

Mesh：

Year: 2014 PMID： 24556644 PMCID： PMC4104142 DOI： 10.1016/j.artmed.2014.01.001

Source DB: PubMed Journal: Artif Intell Med ISSN： 0933-3657 Impact factor: 5.326

14 in total

1. Ad hoc classification of radiology reports.

Authors: D B Aronow; F Fangfang; W B Croft
Journal: J Am Med Inform Assoc Date: 1999 Sep-Oct Impact factor: 4.497

2. Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

Authors: P G Mutalik; A Deshpande; P M Nadkarni
Journal: J Am Med Inform Assoc Date: 2001 Nov-Dec Impact factor: 4.497

Cue-based assertion classification for Swedish clinical text--developing a lexicon for pyConTextSwe.

1. Ad hoc classification of radiology reports.

2. Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

3. MITRE system for clinical assertion status classification.

4. Biomedical negation scope detection with conditional random fields.

5. Machine learning and rule-based approaches to assertion classification.

6. Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm.

7. A general natural-language text processor for clinical radiology.

8. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

9. Negation detection in Swedish clinical text: An adaption of NegEx to Swedish.

10. Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.

Review 1. Clinical Natural Language Processing in 2014: Foundational Methods Supporting Efficient Healthcare.

Review 2. Recent Advances in Clinical Natural Language Processing in Support of Semantic Analysis.

3. Contextual property detection in Dutch diagnosis descriptions for uncertainty, laterality and temporality.

4. Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach.

Review 5. Clinical Natural Language Processing in languages other than English: opportunities and challenges.