Literature DB >> 9357692

Assessing the feasibility of large-scale natural language processing in a corpus of ordinary medical records: a lexical analysis.

W R Hersh1, E M Campbell, S E Malveau.   

Abstract

OBJECTIVE: Identify the lexical content of a large corpus of ordinary medical records to assess the feasibility of large-scale natural language processing.
METHODS: A corpus of 560 megabytes of medical record text from an academic medical center was broken into individual words and compared with the words in six medical vocabularies, a common word list, and a database of patient names. Unrecognized words were assessed for algorithmic and contextual approaches to identifying more words, while the remainder were analyzed for spelling correctness.
RESULTS: About 60% of the words occurred in the medical vocabularies, common word list, or names database. Of the remainder, one-third were recognizable by other means. Of the remaining unrecognizable words, over three-fourths represented correctly spelled real words and the rest were misspellings.
CONCLUSIONS: Large-scale generalized natural language processing methods for the medical record will require expansion of existing vocabularies, spelling error correction, and other algorithmic approaches to map words into those from clinical vocabularies.

Entities:  

Mesh:

Year:  1997        PMID: 9357692      PMCID: PMC2233467     

Source DB:  PubMed          Journal:  Proc AMIA Annu Fall Symp        ISSN: 1091-8280


  8 in total

1.  Planned NLM/AHCPR large-scale vocabulary test: using UMLS technology to determine the extent to which controlled vocabularies cover terminology needed for health care and public health.

Authors:  B L Humphreys; W T Hole; A T McCray; J M Fitzmaurice
Journal:  J Am Med Inform Assoc       Date:  1996 Jul-Aug       Impact factor: 4.497

Review 2.  Outcome analysis: considerations for an electronic health record.

Authors:  R H Dolin
Journal:  MD Comput       Date:  1997 Jan-Feb

3.  Knowledge-based approaches to the maintenance of a large controlled medical terminology.

Authors:  J J Cimino; P D Clayton; G Hripcsak; S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1994 Jan-Feb       Impact factor: 4.497

Review 4.  Natural language processing and the representation of clinical data.

Authors:  N Sager; M Lyman; C Bucknall; N Nhan; L J Tick
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

5.  A general natural-language text processor for clinical radiology.

Authors:  C Friedman; P O Alderson; J H Austin; J J Cimino; S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

6.  UMLS knowledge for biomedical language processing.

Authors:  A T McCray; A R Aronson; A C Browne; T C Rindflesch; A Razi; S Srinivasan
Journal:  Bull Med Libr Assoc       Date:  1993-04

7.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

8.  Unlocking clinical data from narrative reports: a study of natural language processing.

Authors:  G Hripcsak; C Friedman; P O Alderson; W DuMouchel; S B Johnson; P D Clayton
Journal:  Ann Intern Med       Date:  1995-05-01       Impact factor: 25.391

  8 in total
  4 in total

1.  Identification of misspelled words without a comprehensive dictionary using prevalence analysis.

Authors:  Alexander Turchin; Julia T Chu; Maria Shubina; Jonathan S Einbinder
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

2.  A UMLS-based spell checker for natural language processing in vaccine safety.

Authors:  Herman D Tolentino; Michael D Matters; Wikke Walop; Barbara Law; Wesley Tong; Fang Liu; Paul Fontelo; Katrin Kohl; Daniel C Payne
Journal:  BMC Med Inform Decis Mak       Date:  2007-02-12       Impact factor: 2.796

3.  FlexiTerm: a flexible term recognition method.

Authors:  Irena Spasić; Mark Greenwood; Alun Preece; Nick Francis; Glyn Elwyn
Journal:  J Biomed Semantics       Date:  2013-10-10

4.  KneeTex: an ontology-driven system for information extraction from MRI reports.

Authors:  Irena Spasić; Bo Zhao; Christopher B Jones; Kate Button
Journal:  J Biomed Semantics       Date:  2015-09-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.