Literature DB >> 23304329

Towards a semantic lexicon for clinical natural language processing.

Hongfang Liu1, Stephen T Wu, Dingcheng Li, Siddhartha Jonnalagadda, Sunghwan Sohn, Kavishwar Wagholikar, Peter J Haug, Stanley M Huff, Christopher G Chute.   

Abstract

A semantic lexicon which associates words and phrases in text to concepts is critical for extracting and encoding clinical information in free text and therefore achieving semantic interoperability between structured and unstructured data in Electronic Health Records (EHRs). Directly using existing standard terminologies may have limited coverage with respect to concepts and their corresponding mentions in text. In this paper, we analyze how tokens and phrases in a large corpus distribute and how well the UMLS captures the semantics. A corpus-driven semantic lexicon, MedLex, has been constructed where the semantics is based on the UMLS assisted with variants mined and usage information gathered from clinical text. The detailed corpus analysis of tokens, chunks, and concept mentions shows the UMLS is an invaluable source for natural language processing. Increasing the semantic coverage of tokens provides a good foundation in capturing clinical information comprehensively. The study also yields some insights in developing practical NLP systems.

Mesh:

Year:  2012        PMID: 23304329      PMCID: PMC3540492     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  20 in total

1.  A method for vocabulary development and visualization based on medical language processing and XML.

Authors:  H Liu; C Friedman
Journal:  Proc AMIA Symp       Date:  2000

2.  Evaluating the UMLS as a source of lexical knowledge for medical language processing.

Authors:  C Friedman; H Liu; L Shagina; S Johnson; G Hripcsak
Journal:  Proc AMIA Symp       Date:  2001

3.  A study of abbreviations in MEDLINE abstracts.

Authors:  Hongfang Liu; Alan R Aronson; Carol Friedman
Journal:  Proc AMIA Symp       Date:  2002

4.  Exploring semantic groups through visual approaches.

Authors:  Olivier Bodenreider; Alexa T McCray
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

5.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

7.  Enhancing acronym/abbreviation knowledge bases with semantic information.

Authors:  Manabu Torii; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

8.  Using machine learning for concept extraction on clinical documents from multiple data sources.

Authors:  Manabu Torii; Kavishwar Wagholikar; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2011-06-27       Impact factor: 4.497

9.  Unified Medical Language System term occurrences in clinical notes: a large-scale corpus analysis.

Authors:  Stephen T Wu; Hongfang Liu; Dingcheng Li; Cui Tao; Mark A Musen; Christopher G Chute; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2012-04-04       Impact factor: 4.497

10.  Overview of BioCreative II gene mention recognition.

Authors:  Larry Smith; Lorraine K Tanabe; Rie Johnson nee Ando; Cheng-Ju Kuo; I-Fang Chung; Chun-Nan Hsu; Yu-Shi Lin; Roman Klinger; Christoph M Friedrich; Kuzman Ganchev; Manabu Torii; Hongfang Liu; Barry Haddow; Craig A Struble; Richard J Povinelli; Andreas Vlachos; William A Baumgartner; Lawrence Hunter; Bob Carpenter; Richard Tzong-Han Tsai; Hong-Jie Dai; Feng Liu; Yifei Chen; Chengjie Sun; Sophia Katrenko; Pieter Adriaans; Christian Blaschke; Rafael Torres; Mariana Neves; Preslav Nakov; Anna Divoli; Manuel Maña-López; Jacinto Mata; W John Wilbur
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

View more
  10 in total

1.  The Sublanguage of Clinical Problem Lists: A Corpus Analysis.

Authors:  Kevin J Peterson; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Identifying Peripheral Arterial Disease Cases Using Natural Language Processing of Clinical Notes.

Authors:  Naveed Afzal; Sunghwan Sohn; Sara Abram; Hongfang Liu; Iftikhar J Kullo; Adelaide M Arruda-Olson
Journal:  IEEE EMBS Int Conf Biomed Health Inform       Date:  2016-04-21

3.  Evaluation of an Automated Information Extraction Tool for Imaging Data Elements to Populate a Breast Cancer Screening Registry.

Authors:  Ronilda Lacson; Kimberly Harris; Phyllis Brawarsky; Tor D Tosteson; Tracy Onega; Anna N A Tosteson; Abby Kaye; Irina Gonzalez; Robyn Birdwell; Jennifer S Haas
Journal:  J Digit Imaging       Date:  2015-10       Impact factor: 4.056

4.  Generalized Extraction and Classification of Span-Level Clinical Phrases.

Authors:  Tyler Baldwin; Yufan Guo; Vandana V Mukherjee; Tanveer Syeda-Mahmood
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

5.  Facilitating post-surgical complication detection through sublanguage analysis.

Authors:  Hongfang Liu; Sunghwan Sohn; Sean Murphy; Jenna Lovely; Matthew Burton; James Naessens; David W Larson
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2014-04-07

Review 6.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

7.  Clinical documentation variations and NLP system portability: a case study in asthma birth cohorts across institutions.

Authors:  Sunghwan Sohn; Yanshan Wang; Chung-Il Wi; Elizabeth A Krusemark; Euijung Ryu; Mir H Ali; Young J Juhn; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2018-03-01       Impact factor: 4.497

8.  Automated SNOMED CT concept and attribute relationship detection through a web-based implementation of cTAKES.

Authors:  Martijn G Kersloot; Francis Lau; Ameen Abu-Hanna; Derk L Arts; Ronald Cornet
Journal:  J Biomed Semantics       Date:  2019-09-18

9.  Evaluation of a Concept Mapping Task Using Named Entity Recognition and Normalization in Unstructured Clinical Text.

Authors:  Sapna Trivedi; Roger Gildersleeve; Sandra Franco; Andrew S Kanter; Afzal Chaudhry
Journal:  J Healthc Inform Res       Date:  2020-10-16

10.  Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation.

Authors:  Peijin Han; Sunyang Fu; Julie Kolis; Richard Hughes; Brian R Hallstrom; Martha Carvour; Hilal Maradit-Kremers; Sunghwan Sohn; V G Vinod Vydiswaran
Journal:  JMIR Med Inform       Date:  2022-08-31
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.