Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A scoping review of publicly available language tasks in clinical natural language processing.

Literature DB >> 35923088

A scoping review of publicly available language tasks in clinical natural language processing.

Yanjun Gao¹, Dmitriy Dligach², Leslie Christensen³, Samuel Tesch³, Ryan Laffin³, Dongfang Xu⁴, Timothy Miller⁴, Ozlem Uzuner⁵, Matthew M Churpek¹, Majid Afshar¹.

Abstract

OBJECTIVE: To provide a scoping review of papers on clinical natural language processing (NLP) shared tasks that use publicly available electronic health record data from a cohort of patients.
MATERIALS AND METHODS: We searched 6 databases, including biomedical research and computer science literature databases. A round of title/abstract screening and full-text screening were conducted by 2 reviewers. Our method followed the PRISMA-ScR guidelines.
RESULTS: A total of 35 papers with 48 clinical NLP tasks met inclusion criteria between 2007 and 2021. We categorized the tasks by the type of NLP problems, including named entity recognition, summarization, and other NLP tasks. Some tasks were introduced as potential clinical decision support applications, such as substance abuse detection, and phenotyping. We summarized the tasks by publication venue and dataset type. DISCUSSION: The breadth of clinical NLP tasks continues to grow as the field of NLP evolves with advancements in language systems. However, gaps exist with divergent interests between the general domain NLP community and the clinical informatics community for task motivation and design, and in generalizability of the data sources. We also identified issues in data preparation.
CONCLUSION: The existing clinical NLP tasks cover a wide range of topics and the field is expected to grow and attract more attention from both general domain NLP and clinical informatics community. We encourage future work to incorporate multidisciplinary collaboration, reporting transparency, and standardization in data preparation. We provide a listing of all the shared task papers and datasets from this review in a GitLab repository.

Entities: Chemical

Keywords: clinical decision support; clinical informatics; electronic health records; natural language processing; systematic review

Mesh：

Year: 2022 PMID： 35923088 PMCID： PMC9471718 DOI： 10.1093/jamia/ocac127

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 7.942

28 in total

Review 1. Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2.

Authors: Amber Stubbs; Christopher Kotfila; Hua Xu; Özlem Uzuner
Journal: J Biomed Inform Date: 2015-07-22 Impact factor: 6.317

2. Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

Authors: Amber Stubbs; Özlem Uzuner
Journal: J Biomed Inform Date: 2015-08-28 Impact factor: 6.317

3. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors: Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal: J Am Med Inform Assoc Date: 2011-06-16 Impact factor: 4.497

4. Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.

Authors: Wendy W Chapman; Prakash M Nadkarni; Lynette Hirschman; Leonard W D'Avolio; Guergana K Savova; Ozlem Uzuner
Journal: J Am Med Inform Assoc Date: 2011 Sep-Oct Impact factor: 4.497

Review 5. Symptom severity prediction from neuropsychiatric clinical records: Overview of 2016 CEGS N-GRID shared tasks Track 2.

Authors: Michele Filannino; Amber Stubbs; Özlem Uzuner
Journal: J Biomed Inform Date: 2017-04-25 Impact factor: 6.317

10. Family History Extraction From Synthetic Clinical Narratives Using Natural Language Processing: Overview and Evaluation of a Challenge Data Set and Solutions for the 2019 National NLP Clinical Challenges (n2c2)/Open Health Natural Language Processing (OHNLP) Competition.

Authors: Feichen Shen; Sijia Liu; Sunyang Fu; Yanshan Wang; Sam Henry; Ozlem Uzuner; Hongfang Liu
Journal: JMIR Med Inform Date: 2021-01-27

1 in total

1. Research synthesis as a strategy for advancing biomedical and health informatics knowledge.

Authors: Suzanne Bakken
Journal: J Am Med Inform Assoc Date: 2022-09-12 Impact factor: 7.942

1 in total

A scoping review of publicly available language tasks in clinical natural language processing.

Review 1. Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2.

2. Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

3. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

4. Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.

Review 5. Symptom severity prediction from neuropsychiatric clinical records: Overview of 2016 CEGS N-GRID shared tasks Track 2.

6. Identifying patient smoking status from medical discharge records.

Review 7. Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

8. Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.

9. Rayyan-a web and mobile app for systematic reviews.

10. Family History Extraction From Synthetic Clinical Narratives Using Natural Language Processing: Overview and Evaluation of a Challenge Data Set and Solutions for the 2019 National NLP Clinical Challenges (n2c2)/Open Health Natural Language Processing (OHNLP) Competition.

1. Research synthesis as a strategy for advancing biomedical and health informatics knowledge.