Literature DB >> 26063745

Domain adaptation for semantic role labeling of clinical text.

Yaoyun Zhang1, Buzhou Tang2, Min Jiang1, Jingqi Wang1, Hua Xu3.   

Abstract

OBJECTIVE: Semantic role labeling (SRL), which extracts a shallow semantic relation representation from different surface textual forms of free text sentences, is important for understanding natural language. Few studies in SRL have been conducted in the medical domain, primarily due to lack of annotated clinical SRL corpora, which are time-consuming and costly to build. The goal of this study is to investigate domain adaptation techniques for clinical SRL leveraging resources built from newswire and biomedical literature to improve performance and save annotation costs.
MATERIALS AND METHODS: Multisource Integrated Platform for Answering Clinical Questions (MiPACQ), a manually annotated SRL clinical corpus, was used as the target domain dataset. PropBank and NomBank from newswire and BioProp from biomedical literature were used as source domain datasets. Three state-of-the-art domain adaptation algorithms were employed: instance pruning, transfer self-training, and feature augmentation. The SRL performance using different domain adaptation algorithms was evaluated by using 10-fold cross-validation on the MiPACQ corpus. Learning curves for the different methods were generated to assess the effect of sample size. RESULTS AND
CONCLUSION: When all three source domain corpora were used, the feature augmentation algorithm achieved statistically significant higher F-measure (83.18%), compared to the baseline with MiPACQ dataset alone (F-measure, 81.53%), indicating that domain adaptation algorithms may improve SRL performance on clinical text. To achieve a comparable performance to the baseline method that used 90% of MiPACQ training samples, the feature augmentation algorithm required <50% of training samples in MiPACQ, demonstrating that annotation costs of clinical SRL can be reduced significantly by leveraging existing SRL resources from other domains.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  clinical natural language processing; domain adaptation; semantic role labeling; shallow semantic parsing; transfer learning

Mesh:

Year:  2015        PMID: 26063745      PMCID: PMC4986662          DOI: 10.1093/jamia/ocu048

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  22 in total

1.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

2.  Towards semantic role labeling & IE in the medical literature.

Authors:  Yacov Kogan; Nigel Collier; Serguei Pakhomov; Michael Krauthammer
Journal:  AMIA Annu Symp Proc       Date:  2005

3.  MedEx: a medication information extraction system for clinical narratives.

Authors:  Hua Xu; Shane P Stenner; Son Doan; Kevin B Johnson; Lemuel R Waitman; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2010 Jan-Feb       Impact factor: 4.497

4.  Domain adaptation for semantic role labeling in the biomedical domain.

Authors:  Daniel Dahlmeier; Hwee Tou Ng
Journal:  Bioinformatics       Date:  2010-02-23       Impact factor: 6.937

5.  Statistical parsing of varieties of clinical Finnish.

Authors:  Veronika Laippala; Timo Viljanen; Antti Airola; Jenna Kanerva; Sanna Salanterä; Tapio Salakoski; Filip Ginter
Journal:  Artif Intell Med       Date:  2014-03-05       Impact factor: 5.326

6.  UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text.

Authors:  Dina Demner-Fushman; James G Mork; Sonya E Shooshan; Alan R Aronson
Journal:  J Biomed Inform       Date:  2010-02-10       Impact factor: 6.317

7.  Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation.

Authors:  Jeffrey P Ferraro; Hal Daumé; Scott L Duvall; Wendy W Chapman; Henk Harkema; Peter J Haug
Journal:  J Am Med Inform Assoc       Date:  2013-03-13       Impact factor: 4.497

8.  SemMedDB: a PubMed-scale repository of biomedical semantic predications.

Authors:  Halil Kilicoglu; Dongwook Shin; Marcelo Fiszman; Graciela Rosemblat; Thomas C Rindflesch
Journal:  Bioinformatics       Date:  2012-10-08       Impact factor: 6.937

9.  Semantic role labeling for protein transport predicates.

Authors:  Steven Bethard; Zhiyong Lu; James H Martin; Lawrence Hunter
Journal:  BMC Bioinformatics       Date:  2008-06-11       Impact factor: 3.169

10.  Towards comprehensive syntactic and semantic annotations of the clinical narrative.

Authors:  Daniel Albright; Arrick Lanfranchi; Anwen Fredriksen; William F Styler; Colin Warner; Jena D Hwang; Jinho D Choi; Dmitriy Dligach; Rodney D Nielsen; James Martin; Wayne Ward; Martha Palmer; Guergana K Savova
Journal:  J Am Med Inform Assoc       Date:  2013-01-25       Impact factor: 4.497

View more
  9 in total

1.  Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

Authors:  Hee-Jin Lee; Yaoyun Zhang; Kirk Roberts; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2016-11-10

3.  A hybrid approach to automatic de-identification of psychiatric notes.

Authors:  Hee-Jin Lee; Yonghui Wu; Yaoyun Zhang; Jun Xu; Hua Xu; Kirk Roberts
Journal:  J Biomed Inform       Date:  2017-06-07       Impact factor: 6.317

4.  Defining Phenotypes from Clinical Data to Drive Genomic Research.

Authors:  Jamie R Robinson; Wei-Qi Wei; Dan M Roden; Joshua C Denny
Journal:  Annu Rev Biomed Data Sci       Date:  2018-04-25

5.  Adapting Word Embeddings from Multiple Domains to Symptom Recognition from Psychiatric Notes.

Authors:  Yaoyun Zhang; Hee-Jin Li; Jingqi Wang; Trevor Cohen; Kirk Roberts; Hua Xu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18

6.  Ranking Medical Terms to Support Expansion of Lay Language Resources for Patient Comprehension of Electronic Health Record Notes: Adapted Distant Supervision Approach.

Authors:  Jinying Chen; Abhyuday N Jagannatha; Samah J Fodeh; Hong Yu
Journal:  JMIR Med Inform       Date:  2017-10-31

7.  FasTag: Automatic text classification of unstructured medical narratives.

Authors:  Guhan Ram Venkataraman; Arturo Lopez Pineda; Oliver J Bear Don't Walk Iv; Ashley M Zehnder; Sandeep Ayyar; Rodney L Page; Carlos D Bustamante; Manuel A Rivas
Journal:  PLoS One       Date:  2020-06-22       Impact factor: 3.240

8.  CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.

Authors:  Richard Jackson; Ismail Kartoglu; Clive Stringer; Genevieve Gorrell; Angus Roberts; Xingyi Song; Honghan Wu; Asha Agrawal; Kenneth Lui; Tudor Groza; Damian Lewsley; Doug Northwood; Amos Folarin; Robert Stewart; Richard Dobson
Journal:  BMC Med Inform Decis Mak       Date:  2018-06-25       Impact factor: 2.796

9.  A bibliometric analysis of natural language processing in medical research.

Authors:  Xieling Chen; Haoran Xie; Fu Lee Wang; Ziqing Liu; Juan Xu; Tianyong Hao
Journal:  BMC Med Inform Decis Mak       Date:  2018-03-22       Impact factor: 2.796

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.