Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Trustworthy assertion classification through prompting.

Literature DB >> 35811026

Trustworthy assertion classification through prompting.

Song Wang¹, Liyan Tang¹, Akash Majety², Justin F Rousseau³, George Shih⁴, Ying Ding¹, Yifan Peng⁵.

Abstract

Accurate identification of the presence, absence or possibility of relevant entities in clinical notes is important for healthcare professionals to quickly understand crucial clinical information. This introduces the task of assertion classification - to correctly identify the assertion status of an entity in the unstructured clinical notes. Recent rule-based and machine-learning approaches suffer from labor-intensive pattern engineering and severe class bias toward majority classes. To solve this problem, in this study, we propose a prompt-based learning approach, which treats the assertion classification task as a masked language auto-completion problem. We evaluated the model on six datasets. Our prompt-based method achieved a micro-averaged F-1 of 0.954 on the i2b2 2010 assertion dataset, with ∼1.8% improvements over previous works. In particular, our model showed excellence in detecting classes with few instances (few-shot). Evaluations on five external datasets showcase the outstanding generalizability of the prompt-based method to unseen data. To examine the rationality of our model, we further introduced two rationale faithfulness metrics: comprehensiveness and sufficiency. The results reveal that compared to the "pre-train, fine-tune" procedure, our prompt-based model has a stronger capability of identifying the comprehensive (∼63.93%) and sufficient (∼11.75%) linguistic features from free text. We further evaluated the model-agnostic explanations using LIME. The results imply a better rationale agreement between our model and human beings (∼71.93% in average F-1), which demonstrates the superior trustworthiness of our model.

Entities: Chemical

Keywords: Concept assertion; Deep learning; NLP; Prompt-based learning

Mesh：

Year: 2022 PMID： 35811026 PMCID： PMC9378721 DOI： 10.1016/j.jbi.2022.104139

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 8.000

Keyword Cloud
References

15 in total

1. Evaluation of negation phrases in narrative clinical reports.

Authors: W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal: Proc AMIA Symp Date: 2001

2. A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors: W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal: J Biomed Inform Date: 2001-10 Impact factor: 6.317

3. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors: Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal: J Am Med Inform Assoc Date: 2011-06-16 Impact factor: 4.497

4. A flexible framework for deriving assertions from electronic medical records.

Authors: Kirk Roberts; Sanda M Harabagiu
Journal: J Am Med Inform Assoc Date: 2011-07-01 Impact factor: 4.497

5. Building gold standard corpora for medical natural language processing tasks.

Authors: Louise Deleger; Qi Li; Todd Lingren; Megan Kaiser; Katalin Molnar; Laura Stoutenborough; Michal Kouril; Keith Marsolo; Imre Solti
Journal: AMIA Annu Symp Proc Date: 2012-11-03

6. The Role of a Deep-Learning Method for Negation Detection in Patient Cohort Identification from Electroencephalography Reports.

Authors: Stuart J Taylor; Sanda M Harabagiu
Journal: AMIA Annu Symp Proc Date: 2018-12-05

Trustworthy assertion classification through prompting.

1. Evaluation of negation phrases in narrative clinical reports.

2. A simple algorithm for identifying negated findings and diseases in discharge summaries.

3. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

4. A flexible framework for deriving assertions from electronic medical records.

5. Building gold standard corpora for medical natural language processing tasks.

6. The Role of a Deep-Learning Method for Negation Detection in Patient Cohort Identification from Electroencephalography Reports.

7. Launching into clinical space with medspaCy: a new clinical text processing toolkit in Python.

Review 8. Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

9. Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.

10. NegBio: a high-performance tool for negation and uncertainty detection in radiology reports.