Literature DB >> 9391936

Evaluating the coverage of controlled health data terminologies: report on the results of the NLM/AHCPR large scale vocabulary test.

B L Humphreys1, A T McCray, M L Cheh.   

Abstract

OBJECTIVE: To determine the extent to which a combination of existing machine-readable health terminologies cover the concepts and terms needed for a comprehensive controlled vocabulary for health information systems by carrying out a distributed national experiment using the Internet and the UMLS Knowledge Sources, lexical programs, and server.
METHODS: Using a specially designed Web-based interface to the UMLS Knowledge Source Server, participants searched the more than 30 vocabularies in the 1996 UMLS Metathesaurus and three planned additions to determine if concepts for which they desired controlled terminology were present or absent. For each term submitted, the interface presented a candidate exact match or a set of potential approximate matches from which the participant selected the most closely related concept. The interface captured a profile of the terms submitted by the participant and for each term searched, information about the concept (if any) selected by the participant. The term information was loaded into a database at NLM for review and analysis and was also available to be downloaded by the participant. A team of subject experts reviewed records to identify matches missed by participants and to correct any obvious errors in relationships. The editors of SNOMED International and the Read Codes were given a random sample of reviewed terms for which exact meaning matches were not found to identify exact matches that were missed or any valid combinations of concepts that were synonymous to input terms. The 1997 UMLS Metathesaurus was used in the semantic type and vocabulary source analysis because it included most of the three planned additions.
RESULTS: Sixty-three participants submitted a total of 41,127 terms, which represented 32,679 normalized strings. More than 80% of the terms submitted were wanted for parts of the patient record related to the patient's condition. Following review, 58% of all submitted terms had exact meaning matches in the controlled vocabularies in the test, 41% had related concepts, and 1% were not found. Of the 28% of the terms which were narrower in meaning than a concept in the controlled vocabularies, 86% shared lexical items with the broader concept, but had additional modification. The percentage of exact meanings matches varied by specialty from 45% to 71%. Twenty-nine different vocabularies contained meanings for some of the 23,837 terms (a maximum of 12,707 discrete concepts) with exact meaning matches. Based on preliminary data and analysis, individual vocabularies contained < 1% to 63% of the terms and < 1% to 54% of the concepts. Only SNOMED International and the Read Codes had more than 60% of the terms and more than 50% of the concepts.
CONCLUSIONS: The combination of existing controlled vocabularies included in the test represents the meanings of the majority of the terminology needed to record patient conditions, providing substantially more exact matches than any individual vocabulary in the set. From a technical and organizational perspective, the test was successful and should serve as a useful model, both for distributed input to the enhancement of controlled vocabularies and for other kinds of collaborative informatics research.

Entities:  

Mesh:

Year:  1997        PMID: 9391936      PMCID: PMC61267          DOI: 10.1136/jamia.1997.0040484

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  12 in total

1.  Planned NLM/AHCPR large-scale vocabulary test: using UMLS technology to determine the extent to which controlled vocabularies cover terminology needed for health care and public health.

Authors:  B L Humphreys; W T Hole; A T McCray; J M Fitzmaurice
Journal:  J Am Med Inform Assoc       Date:  1996 Jul-Aug       Impact factor: 4.497

2.  A clinically derived terminology: qualification to reduction.

Authors:  C G Chute; P L Elkin
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

3.  The UMLS Knowledge Source Server: a versatile Internet-based research tool.

Authors:  A T McCray; A M Razi; A K Bangalore; A C Browne; P Z Stavri
Journal:  Proc AMIA Annu Fall Symp       Date:  1996

4.  Read Codes Version 3: a user led terminology.

Authors:  M O'Neil; C Payne; J Read
Journal:  Methods Inf Med       Date:  1995-03       Impact factor: 2.176

5.  Call for a standard clinical vocabulary.

Authors:  W E Hammond
Journal:  J Am Med Inform Assoc       Date:  1997 May-Jun       Impact factor: 4.497

6.  The content coverage of clinical classifications. For The Computer-Based Patient Record Institute's Work Group on Codes & Structures.

Authors:  C G Chute; S P Cohn; K E Campbell; D E Oliver; J R Campbell
Journal:  J Am Med Inform Assoc       Date:  1996 May-Jun       Impact factor: 4.497

7.  Phase II evaluation of clinical coding schemes: completeness, taxonomy, mapping, definitions, and clarity. CPRI Work Group on Codes and Structures.

Authors:  J R Campbell; P Carpenter; C Sneiderman; S Cohn; C G Chute; J Warren
Journal:  J Am Med Inform Assoc       Date:  1997 May-Jun       Impact factor: 4.497

8.  Standards for medical identifiers, codes, and messages needed to create an efficient computer-stored medical record. American Medical Informatics Association.

Authors: 
Journal:  J Am Med Inform Assoc       Date:  1994 Jan-Feb       Impact factor: 4.497

9.  The measurement of observer agreement for categorical data.

Authors:  J R Landis; G G Koch
Journal:  Biometrics       Date:  1977-03       Impact factor: 2.571

10.  Lexical methods for managing variation in biomedical terminologies.

Authors:  A T McCray; S Srinivasan; A C Browne
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1994
View more
  42 in total

1.  Need a bloody nose be a nosebleed? or, lexical variants cause surprising results.

Authors:  M E Sievert; T B Patrick; J C Reid
Journal:  Bull Med Libr Assoc       Date:  2001-01

2.  Toward vocabulary domain specifications for health level 7-coded data elements.

Authors:  S Bakken; K E Campbell; J J Cimino; S M Huff; W E Hammond
Journal:  J Am Med Inform Assoc       Date:  2000 Jul-Aug       Impact factor: 4.497

3.  A national agenda for public health informatics: summarized recommendations from the 2001 AMIA Spring Congress.

Authors:  W A Yasnoff; J M Overhage; B L Humphreys; M LaVenture
Journal:  J Am Med Inform Assoc       Date:  2001 Nov-Dec       Impact factor: 4.497

4.  Issues in the design of medical ontologies used for knowledge sharing.

Authors:  A Burgun; G Botti; M Fieschi; P Le Beux
Journal:  J Med Syst       Date:  2001-04       Impact factor: 4.460

5.  Does size matter?--Evaluation of value added content of two decades of successive coding schemes in secondary care.

Authors:  P J Brown; L Odusanya
Journal:  Proc AMIA Symp       Date:  2001

6.  Coverage of oncology drug indication concepts and compositional semantics by SNOMED-CT.

Authors:  Steven H Brown; Brent A Bauer; Dietland L Wahner-Roedler; Peter L Elkin
Journal:  AMIA Annu Symp Proc       Date:  2003

7.  An applied evaluation of SNOMED CT as a clinical vocabulary for the computerized diagnosis and problem list.

Authors:  Henry Wasserman; Jerome Wang
Journal:  AMIA Annu Symp Proc       Date:  2003

8.  An evaluation of the UMLS in representing corpus derived clinical concepts.

Authors:  Jeff Friedlin; Marc Overhage
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

9.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

10.  The UMLS-CORE project: a study of the problem list terminologies used in large healthcare institutions.

Authors:  Kin Wah Fung; Clement McDonald; Suresh Srinivasan
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.