Literature DB >> 33987667

Deep-learning-based automated terminology mapping in OMOP-CDM.

Byungkon Kang1, Jisang Yoon2, Ha Young Kim2, Sung Jin Jo3, Yourim Lee4, Hye Jin Kam5.   

Abstract

OBJECTIVE: Accessing medical data from multiple institutions is difficult owing to the interinstitutional diversity of vocabularies. Standardization schemes, such as the common data model, have been proposed as solutions to this problem, but such schemes require expensive human supervision. This study aims to construct a trainable system that can automate the process of semantic interinstitutional code mapping.
MATERIALS AND METHODS: To automate mapping between source and target codes, we compute the embedding-based semantic similarity between corresponding descriptive sentences. We also implement a systematic approach for preparing training data for similarity computation. Experimental results are compared to traditional word-based mappings.
RESULTS: The proposed model is compared against the state-of-the-art automated matching system, which is called Usagi, of the Observational Medical Outcomes Partnership common data model. By incorporating multiple negative training samples per positive sample, our semantic matching method significantly outperforms Usagi. Its matching accuracy is at least 10% greater than that of Usagi, and this trend is consistent across various top-k measurements. DISCUSSION: The proposed deep learning-based mapping approach outperforms previous simple word-level matching algorithms because it can account for contextual and semantic information. Additionally, we demonstrate that the manner in which negative training samples are selected significantly affects the overall performance of the system.
CONCLUSION: Incorporating the semantics of code descriptions more significantly increases matching accuracy compared to traditional text co-occurrence-based approaches. The negative training sample collection methodology is also an important component of the proposed trainable system that can be adopted in both present and future related systems.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  automated mapping; common data model; deep-learning; embedding; terminology mapping

Mesh:

Year:  2021        PMID: 33987667      PMCID: PMC8279781          DOI: 10.1093/jamia/ocab030

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  13 in total

1.  Converting clinical document architecture documents to the common data model for incorporating health information exchange data in observational health studies: CDA to CDM.

Authors:  Hyerim Ji; Seok Kim; Soyoung Yi; Hee Hwang; Jeong-Whun Kim; Sooyoung Yoo
Journal:  J Biomed Inform       Date:  2020-05-26       Impact factor: 6.317

2.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

3.  Web services for data warehouses: OMOP and PCORnet on i2b2.

Authors:  Jeffrey G Klann; Lori C Phillips; Christopher Herrick; Matthew A H Joss; Kavishwar B Wagholikar; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2018-10-01       Impact factor: 4.497

4.  Evaluating common data models for use with a longitudinal community registry.

Authors:  Maryam Garza; Guilherme Del Fiol; Jessica Tenenbaum; Anita Walden; Meredith Nahm Zozus
Journal:  J Biomed Inform       Date:  2016-10-29       Impact factor: 6.317

Review 5.  A review of medical terminology standards and structured reporting.

Authors:  Abdullah Awaysheh; Jeffrey Wilcke; François Elvinger; Loren Rees; Weiguo Fan; Kurt Zimmerman
Journal:  J Vet Diagn Invest       Date:  2017-10-15       Impact factor: 1.279

6.  Incrementally Transforming Electronic Medical Records into the Observational Medical Outcomes Partnership Common Data Model: A Multidimensional Quality Assurance Approach.

Authors:  Kristine E Lynch; Stephen A Deppen; Scott L DuVall; Benjamin Viernes; Aize Cao; Daniel Park; Elizabeth Hanchrow; Kushan Hewa; Peter Greaves; Michael E Matheny
Journal:  Appl Clin Inform       Date:  2019-10-23       Impact factor: 2.342

7.  Transforming French Electronic Health Records into the Observational Medical Outcome Partnership's Common Data Model: A Feasibility Study.

Authors:  Antoine Lamer; Nicolas Depas; Matthieu Doutreligne; Adrien Parrot; David Verloop; Marguerite-Marie Defebvre; Grégoire Ficheur; Emmanuel Chazard; Jean-Baptiste Beuscart
Journal:  Appl Clin Inform       Date:  2020-01-08       Impact factor: 2.342

8.  Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model.

Authors:  Jeffrey G Klann; Matthew A H Joss; Kevin Embree; Shawn N Murphy
Journal:  PLoS One       Date:  2019-02-19       Impact factor: 3.240

9.  SHRINE: enabling nationally scalable multi-site disease studies.

Authors:  Andrew J McMurry; Shawn N Murphy; Douglas MacFadden; Griffin Weber; William W Simons; John Orechia; Jonathan Bickel; Nich Wattanasin; Clint Gilbert; Philip Trevvett; Susanne Churchill; Isaac S Kohane
Journal:  PLoS One       Date:  2013-03-07       Impact factor: 3.240

10.  Can We Rely on Results From IQVIA Medical Research Data UK Converted to the Observational Medical Outcome Partnership Common Data Model?: A Validation Study Based on Prescribing Codeine in Children.

Authors:  Gianmario Candore; Karin Hedenmalm; Jim Slattery; Alison Cave; Xavier Kurz; Peter Arlett
Journal:  Clin Pharmacol Ther       Date:  2020-01-19       Impact factor: 6.875

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.