Literature DB >> 15684131

Improved identification of noun phrases in clinical radiology reports using a high-performance statistical natural language parser augmented with the UMLS specialist lexicon.

Yang Huang1, Henry J Lowe, Dan Klein, Russell J Cucina.   

Abstract

OBJECTIVE: The aim of this study was to develop and evaluate a method of extracting noun phrases with full phrase structures from a set of clinical radiology reports using natural language processing (NLP) and to investigate the effects of using the UMLS(R) Specialist Lexicon to improve noun phrase identification within clinical radiology documents.
DESIGN: The noun phrase identification (NPI) module is composed of a sentence boundary detector, a statistical natural language parser trained on a nonmedical domain, and a noun phrase (NP) tagger. The NPI module processed a set of 100 XML-represented clinical radiology reports in Health Level 7 (HL7)(R) Clinical Document Architecture (CDA)-compatible format. Computed output was compared with manual markups made by four physicians and one author for maximal (longest) NP and those made by one author for base (simple) NP, respectively. An extended lexicon of biomedical terms was created from the UMLS Specialist Lexicon and used to improve NPI performance.
RESULTS: The test set was 50 randomly selected reports. The sentence boundary detector achieved 99.0% precision and 98.6% recall. The overall maximal NPI precision and recall were 78.9% and 81.5% before using the UMLS Specialist Lexicon and 82.1% and 84.6% after. The overall base NPI precision and recall were 88.2% and 86.8% before using the UMLS Specialist Lexicon and 93.1% and 92.6% after, reducing false-positives by 31.1% and false-negatives by 34.3%.
CONCLUSION: The sentence boundary detector performs excellently. After the adaptation using the UMLS Specialist Lexicon, the statistical parser's NPI performance on radiology reports increased to levels comparable to the parser's native performance in its newswire training domain and to that reported by other researchers in the general nonmedical domain.

Entities:  

Mesh:

Year:  2005        PMID: 15684131      PMCID: PMC1090458          DOI: 10.1197/jamia.M1695

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  37 in total

1.  Extracting noun phrases for all of MEDLINE.

Authors:  N A Bennett; Q He; K Powell; B R Schatz
Journal:  Proc AMIA Symp       Date:  1999

2.  A statistical natural language processor for medical reports.

Authors:  R K Taira; S G Soderland
Journal:  Proc AMIA Symp       Date:  1999

3.  Knowledge requirements for automated inference of medical textbook markup.

Authors:  D C Berrios; A Kehler; L M Fagan
Journal:  Proc AMIA Symp       Date:  1999

4.  Automatic structuring of radiology free-text reports.

Authors:  R K Taira; S G Soderland; R M Jakobovits
Journal:  Radiographics       Date:  2001 Jan-Feb       Impact factor: 5.333

5.  Automated semantic indexing of imaging reports to support retrieval of medical images in the multimedia electronic medical record.

Authors:  H J Lowe; I Antipov; W Hersh; C A Smith; M Mailhot
Journal:  Methods Inf Med       Date:  1999-12       Impact factor: 2.176

6.  UMLS concept indexing for production databases: a feasibility study.

Authors:  P Nadkarni; R Chen; C Brandt
Journal:  J Am Med Inform Assoc       Date:  2001 Jan-Feb       Impact factor: 4.497

7.  Automated indexing for full text information retrieval.

Authors:  D C Berrios
Journal:  Proc AMIA Symp       Date:  2000

8.  The NLM Indexing Initiative.

Authors:  A R Aronson; O Bodenreider; H F Chang; S M Humphrey; J G Mork; S J Nelson; T C Rindflesch; W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

9.  A broad-coverage natural language processing system.

Authors:  C Friedman
Journal:  Proc AMIA Symp       Date:  2000

10.  The HL7 Clinical Document Architecture.

Authors:  R H Dolin; L Alschuler; C Beebe; P V Biron; S L Boyer; D Essin; E Kimber; T Lincoln; J E Mattison
Journal:  J Am Med Inform Assoc       Date:  2001 Nov-Dec       Impact factor: 4.497

View more
  37 in total

1.  Named entity recognition of follow-up and time information in 20,000 radiology reports.

Authors:  Yan Xu; Junichi Tsujii; Eric I-Chao Chang
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

2.  A practical method for transforming free-text eligibility criteria into computable criteria.

Authors:  Samson W Tu; Mor Peleg; Simona Carini; Michael Bobak; Jessica Ross; Daniel Rubin; Ida Sim
Journal:  J Biomed Inform       Date:  2010-09-17       Impact factor: 6.317

3.  Using a statistical natural language Parser augmented with the UMLS specialist lexicon to assign SNOMED CT codes to anatomic sites and pathologic diagnoses in full text pathology reports.

Authors:  Henry J Lowe; Yang Huang; Donald P Regula
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

4.  Semantic mappings and locality of nursing diagnostic concepts in UMLS.

Authors:  Tae Youn Kim; Amy Coenen; Nicholas Hardiker
Journal:  J Biomed Inform       Date:  2011-09-18       Impact factor: 6.317

5.  Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.

Authors:  Yuan Luo; Özlem Uzuner; Peter Szolovits
Journal:  Brief Bioinform       Date:  2016-02-05       Impact factor: 11.622

6.  HL7 Clinical Document Architecture, Release 2.

Authors:  Robert H Dolin; Liora Alschuler; Sandy Boyer; Calvin Beebe; Fred M Behlen; Paul V Biron; Amnon Shabo Shvo
Journal:  J Am Med Inform Assoc       Date:  2005-10-12       Impact factor: 4.497

7.  A grammar-based classification of negations in clinical radiology reports.

Authors:  Yang Huang; Henry J Lowe
Journal:  AMIA Annu Symp Proc       Date:  2005

8.  Semantic distribution study of noun*noun compounds in the Japanese CT clinical reports.

Authors:  Naoki Nishimoto; Terae Satoshi; Guoqian Jiang; Masahito Uesugi; Takayoshi Terashita; Takumi Tanikawa; Akira Endou; Katsuhiko Ogasawara; Tsunetaro Sakurai
Journal:  AMIA Annu Symp Proc       Date:  2006

9.  A comparison of Intelligent Mapper and document similarity scores for mapping local radiology terms to LOINC.

Authors:  Daniel J Vreeman; Clement J McDonald
Journal:  AMIA Annu Symp Proc       Date:  2006

10.  Automatic lymphoma classification with sentence subgraph mining from pathology reports.

Authors:  Yuan Luo; Aliyah R Sohani; Ephraim P Hochberg; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2014-01-15       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.