Literature DB >> 18693911

The CLEF corpus: semantic annotation of clinical text.

Angus Roberts1, Robert Gaizauskas, Mark Hepple, Neil Davis, George Demetriou, Yikun Guo, Jay Kola, Ian Roberts, Andrea Setzer, Archana Tapuria, Bill Wheeldin.   

Abstract

The Clinical E-Science Framework (CLEF) project is building a framework for the capture, integration and presentation of clinical information: for clinical research, evidence-based health care and genotype-meets-phenotype informatics. A significant portion of the information required by such a framework originates as text, even in EHR-savvy organizations. CLEF uses Information Extraction (IE) to make this unstructured information available. An important part of IE is the identification of semantic entities and relationships. Typical approaches require human annotated documents to provide both evaluation standards and material for system development. CLEF has a corpus of clinical narratives, histopathology reports and imaging reports from 20 thousand patients. We describe the selection of a subset of this corpus for manual annotation of clinical entities and relationships. We describe an annotation methodology and report encouraging initial results of inter-annotator agreement. Comparisons are made between different text sub-genres, and between annotators with different skills.

Entities:  

Mesh:

Year:  2007        PMID: 18693911      PMCID: PMC2655900     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  4 in total

1.  GENIA corpus--semantically annotated corpus for bio-textmining.

Authors:  J-D Kim; T Ohta; Y Tateisi; J Tsujii
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

2.  Agreement, the f-measure, and reliability in information retrieval.

Authors:  George Hripcsak; Adam S Rothschild
Journal:  J Am Med Inform Assoc       Date:  2005-01-31       Impact factor: 4.497

3.  Building and evaluating annotated corpora for medical NLP systems.

Authors:  Philip V Ogren; Guergana Savova; James D Buntrock; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2006

Review 4.  Natural language processing and the representation of clinical data.

Authors:  N Sager; M Lyman; C Bucknall; N Nhan; L J Tick
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

  4 in total
  23 in total

1.  Qualitative analysis of workflow modifications used to generate the reference standard for the 2010 i2b2/VA challenge.

Authors:  Brett R South; Shuying Shen; Robyn Barrus; Scott L DuVall; Ozlem Uzuner; Charlene Weir
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure.

Authors:  Jennifer H Garvin; Scott L DuVall; Brett R South; Bruce E Bray; Daniel Bolton; Julia Heavirland; Steve Pickard; Paul Heidenreich; Shuying Shen; Charlene Weir; Matthew Samore; Mary K Goldstein
Journal:  J Am Med Inform Assoc       Date:  2012-03-21       Impact factor: 4.497

3.  Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

Authors:  Stéphane M Meystre; Julien Thibault; Shuying Shen; John F Hurdle; Brett R South
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Inductive creation of an annotation schema and a reference standard for de-identification of VA electronic clinical notes.

Authors:  Jeanmarie Mayer; Shuying Shen; Brett R South; Stephane Meystre; F Jeff Friedlin; William R Ray; Matthew Samore
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

5.  Methodology to develop and evaluate a semantic representation for NLP.

Authors:  Jeannie Y Irwin; Henk Harkema; Lee M Christensen; Titus Schleyer; Peter J Haug; Wendy W Chapman
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

6.  Standardizing Heterogeneous Annotation Corpora Using HL7 FHIR for Facilitating their Reuse and Integration in Clinical NLP.

Authors:  Na Hong; Andrew Wen; Majid Rastegar Mojarad; Sunghwan Sohn; Hongfang Liu; Guoqian Jiang
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

7.  Active Learning-based corpus annotation--the PathoJen experience.

Authors:  Udo Hahn; Elena Beisswanger; Ekaterina Buyko; Erik Faessler
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

8.  Mining the pharmacogenomics literature--a survey of the state of the art.

Authors:  Udo Hahn; K Bretonnel Cohen; Yael Garten; Nigam H Shah
Journal:  Brief Bioinform       Date:  2012-07       Impact factor: 11.622

9.  Semantic annotation of clinical events for generating a problem list.

Authors:  Danielle L Mowery; Pamela Jordan; Janyce Wiebe; Henk Harkema; John Dowling; Wendy W Chapman
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

10.  Developing a manually annotated clinical document corpus to identify phenotypic information for inflammatory bowel disease.

Authors:  Brett R South; Shuying Shen; Makoto Jones; Jennifer Garvin; Matthew H Samore; Wendy W Chapman; Adi V Gundlapalli
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.