Literature DB >> 19535011

Building a semantically annotated corpus of clinical texts.

Angus Roberts1, Robert Gaizauskas, Mark Hepple, George Demetriou, Yikun Guo, Ian Roberts, Andrea Setzer.   

Abstract

In this paper, we describe the construction of a semantically annotated corpus of clinical texts for use in the development and evaluation of systems for automatically extracting clinically significant information from the textual component of patient records. The paper details the sampling of textual material from a collection of 20,000 cancer patient records, the development of a semantic annotation scheme, the annotation methodology, the distribution of annotations in the final corpus, and the use of the corpus for development of an adaptive information extraction system. The resulting corpus is the most richly semantically annotated resource for clinical text processing built to date, whose value has been demonstrated through its use in developing an effective information extraction system. The detailed presentation of our corpus construction and annotation methodology will be of value to others seeking to build high-quality semantically annotated corpora in biomedical domains.

Entities:  

Mesh:

Year:  2009        PMID: 19535011     DOI: 10.1016/j.jbi.2008.12.013

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  36 in total

1.  Qualitative analysis of workflow modifications used to generate the reference standard for the 2010 i2b2/VA challenge.

Authors:  Brett R South; Shuying Shen; Robyn Barrus; Scott L DuVall; Ozlem Uzuner; Charlene Weir
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  A system for coreference resolution for the clinical narrative.

Authors:  Jiaping Zheng; Wendy W Chapman; Timothy A Miller; Chen Lin; Rebecca S Crowley; Guergana K Savova
Journal:  J Am Med Inform Assoc       Date:  2012-01-31       Impact factor: 4.497

3.  Automatic discourse connective detection in biomedical text.

Authors:  Balaji Polepalli Ramesh; Rashmi Prasad; Tim Miller; Brian Harrington; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2012-06-28       Impact factor: 4.497

Review 4.  Management of Dynamic Biomedical Terminologies: Current Status and Future Challenges.

Authors:  M Da Silveira; J C Dos Reis; C Pruski
Journal:  Yearb Med Inform       Date:  2015-08-13

5.  The Yale cTAKES extensions for document classification: architecture and application.

Authors:  Vijay Garla; Vincent Lo Re; Zachariah Dorey-Stein; Farah Kidwai; Matthew Scotch; Julie Womack; Amy Justice; Cynthia Brandt
Journal:  J Am Med Inform Assoc       Date:  2011-05-27       Impact factor: 4.497

6.  Anaphoric relations in the clinical narrative: corpus creation.

Authors:  Guergana K Savova; Wendy W Chapman; Jiaping Zheng; Rebecca S Crowley
Journal:  J Am Med Inform Assoc       Date:  2011-04-01       Impact factor: 4.497

7.  Expert guided natural language processing using one-class classification.

Authors:  Erel Joffe; Emily J Pettigrew; Jorge R Herskovic; Charles F Bearden; Elmer V Bernstam
Journal:  J Am Med Inform Assoc       Date:  2015-06-10       Impact factor: 4.497

8.  Vaccine adverse event text mining system for extracting features from vaccine safety reports.

Authors:  Taxiarchis Botsis; Thomas Buttolph; Michael D Nguyen; Scott Winiecki; Emily Jane Woo; Robert Ball
Journal:  J Am Med Inform Assoc       Date:  2012-08-25       Impact factor: 4.497

9.  Building gold standard corpora for medical natural language processing tasks.

Authors:  Louise Deleger; Qi Li; Todd Lingren; Megan Kaiser; Katalin Molnar; Laura Stoutenborough; Michal Kouril; Keith Marsolo; Imre Solti
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

10.  Active Learning-based corpus annotation--the PathoJen experience.

Authors:  Udo Hahn; Elena Beisswanger; Ekaterina Buyko; Erik Faessler
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.