Literature DB >> 24316051

Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.

Glenn T Gobbel1, Ruth Reeves2, Shrimalini Jayaramaraja3, Dario Giuse4, Theodore Speroff5, Steven H Brown6, Peter L Elkin7, Michael E Matheny8.   

Abstract

Rapid, automated determination of the mapping of free text phrases to pre-defined concepts could assist in the annotation of clinical notes and increase the speed of natural language processing systems. The aim of this study was to design and evaluate a token-order-specific naïve Bayes-based machine learning system (RapTAT) to predict associations between phrases and concepts. Performance was assessed using a reference standard generated from 2860 VA discharge summaries containing 567,520 phrases that had been mapped to 12,056 distinct Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT) concepts by the MCVS natural language processing system. It was also assessed on the manually annotated, 2010 i2b2 challenge data. Performance was established with regard to precision, recall, and F-measure for each of the concepts within the VA documents using bootstrapping. Within that corpus, concepts identified by MCVS were broadly distributed throughout SNOMED CT, and the token-order-specific language model achieved better performance based on precision, recall, and F-measure (0.95±0.15, 0.96±0.16, and 0.95±0.16, respectively; mean±SD) than the bag-of-words based, naïve Bayes model (0.64±0.45, 0.61±0.46, and 0.60±0.45, respectively) that has previously been used for concept mapping. Precision, recall, and F-measure on the i2b2 test set were 92.9%, 85.9%, and 89.2% respectively, using the token-order-specific model. RapTAT required just 7.2ms to map all phrases within a single discharge summary, and mapping rate did not decrease as the number of processed documents increased. The high performance attained by the tool in terms of both accuracy and speed was encouraging, and the mapping rate should be sufficient to support near-real-time, interactive annotation of medical narratives. These results demonstrate the feasibility of rapidly and accurately mapping phrases to a wide range of medical concepts based on a token-order-specific naïve Bayes model and machine learning. Published by Elsevier Inc.

Keywords:  Bayesian prediction; CSV; FN; FP; IQV; MCVS; Machine learning; Multi-threaded Clinical Vocabulary Server; NLP; Natural language processing; Opt; Perf; RapTAT; Rapid Text Annotation Tool; SNOMED-CT; SVM; Systematized Nomenclature of Medicine-Clinical Terms; Systematized nomenclature of medicine; TP; UMLS; Unified Medical Language System; comma-separated value; false negative; false positive; index of qualitative variation; natural language processing; optimism; performance; support vector machine; true positive

Mesh:

Year:  2013        PMID: 24316051     DOI: 10.1016/j.jbi.2013.11.008

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  12 in total

1.  Interpretable Topic Features for Post-ICU Mortality Prediction.

Authors:  Yen-Fu Luo; Anna Rumshisky
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  Assisted annotation of medical free text using RapTAT.

Authors:  Glenn T Gobbel; Jennifer Garvin; Ruth Reeves; Robert M Cronin; Julia Heavirland; Jenifer Williams; Allison Weaver; Shrimalini Jayaramaraja; Dario Giuse; Theodore Speroff; Steven H Brown; Hua Xu; Michael E Matheny
Journal:  J Am Med Inform Assoc       Date:  2014-01-15       Impact factor: 4.497

3.  Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator.

Authors:  S V Ramanan; Kedar Radhakrishna; Abijeet Waghmare; Tony Raj; Senthil P Nathan; Sai Madhukar Sreerama; Sriram Sampath
Journal:  J Med Syst       Date:  2016-06-24       Impact factor: 4.460

4.  Interactive NLP in Clinical Care: Identifying Incidental Findings in Radiology Reports.

Authors:  Gaurav Trivedi; Esmaeel R Dadashzadeh; Robert M Handzel; Wendy W Chapman; Shyam Visweswaran; Harry Hochheiser
Journal:  Appl Clin Inform       Date:  2019-09-04       Impact factor: 2.342

Review 5.  Capturing the Patient's Perspective: a Review of Advances in Natural Language Processing of Health-Related Text.

Authors:  G Gonzalez-Hernandez; A Sarker; K O'Connor; G Savova
Journal:  Yearb Med Inform       Date:  2017-09-11

6.  Congestive heart failure information extraction framework for automated treatment performance measures assessment.

Authors:  Stéphane M Meystre; Youngjun Kim; Glenn T Gobbel; Michael E Matheny; Andrew Redd; Bruce E Bray; Jennifer H Garvin
Journal:  J Am Med Inform Assoc       Date:  2017-04-01       Impact factor: 4.497

7.  Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research.

Authors:  Denis Newman-Griffis; Jill Fain Lehman; Carolyn Rosé; Harry Hochheiser
Journal:  Proc Conf       Date:  2021-06

8.  Identifying named entities from PubMed for enriching semantic categories.

Authors:  Sun Kim; Zhiyong Lu; W John Wilbur
Journal:  BMC Bioinformatics       Date:  2015-02-21       Impact factor: 3.169

9.  Strategies for improving physician documentation in the emergency department: a systematic review.

Authors:  Diane L Lorenzetti; Hude Quan; Kelsey Lucyk; Ceara Cunningham; Deirdre Hennessy; Jason Jiang; Cynthia A Beck
Journal:  BMC Emerg Med       Date:  2018-10-25

10.  Automating Quality Measures for Heart Failure Using Natural Language Processing: A Descriptive Study in the Department of Veterans Affairs.

Authors:  Jennifer Hornung Garvin; Youngjun Kim; Glenn Temple Gobbel; Michael E Matheny; Andrew Redd; Bruce E Bray; Paul Heidenreich; Dan Bolton; Julia Heavirland; Natalie Kelly; Ruth Reeves; Megha Kalsy; Mary Kane Goldstein; Stephane M Meystre
Journal:  JMIR Med Inform       Date:  2018-01-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.