Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deep-learning-based automated terminology mapping in OMOP-CDM.

Literature DB >> 33987667

Deep-learning-based automated terminology mapping in OMOP-CDM.

Byungkon Kang¹, Jisang Yoon², Ha Young Kim², Sung Jin Jo³, Yourim Lee⁴, Hye Jin Kam⁵.

Abstract

OBJECTIVE: Accessing medical data from multiple institutions is difficult owing to the interinstitutional diversity of vocabularies. Standardization schemes, such as the common data model, have been proposed as solutions to this problem, but such schemes require expensive human supervision. This study aims to construct a trainable system that can automate the process of semantic interinstitutional code mapping.
MATERIALS AND METHODS: To automate mapping between source and target codes, we compute the embedding-based semantic similarity between corresponding descriptive sentences. We also implement a systematic approach for preparing training data for similarity computation. Experimental results are compared to traditional word-based mappings.
RESULTS: The proposed model is compared against the state-of-the-art automated matching system, which is called Usagi, of the Observational Medical Outcomes Partnership common data model. By incorporating multiple negative training samples per positive sample, our semantic matching method significantly outperforms Usagi. Its matching accuracy is at least 10% greater than that of Usagi, and this trend is consistent across various top-k measurements. DISCUSSION: The proposed deep learning-based mapping approach outperforms previous simple word-level matching algorithms because it can account for contextual and semantic information. Additionally, we demonstrate that the manner in which negative training samples are selected significantly affects the overall performance of the system.
CONCLUSION: Incorporating the semantics of code descriptions more significantly increases matching accuracy compared to traditional text co-occurrence-based approaches. The negative training sample collection methodology is also an important component of the proposed trainable system that can be adopted in both present and future related systems.

Entities: Chemical

Keywords: automated mapping; common data model; deep-learning; embedding; terminology mapping

Mesh：

Year: 2021 PMID： 33987667 PMCID： PMC8279781 DOI： 10.1093/jamia/ocab030

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

Keyword Cloud
References

13 in total

1. Converting clinical document architecture documents to the common data model for incorporating health information exchange data in observational health studies: CDA to CDM.

Authors: Hyerim Ji; Seok Kim; Soyoung Yi; Hee Hwang; Jeong-Whun Kim; Sooyoung Yoo
Journal: J Biomed Inform Date: 2020-05-26 Impact factor: 6.317

2. Long short-term memory.

Authors: S Hochreiter; J Schmidhuber
Journal: Neural Comput Date: 1997-11-15 Impact factor: 2.026

3. Web services for data warehouses: OMOP and PCORnet on i2b2.

Authors: Jeffrey G Klann; Lori C Phillips; Christopher Herrick; Matthew A H Joss; Kavishwar B Wagholikar; Shawn N Murphy
Journal: J Am Med Inform Assoc Date: 2018-10-01 Impact factor: 4.497

4. Evaluating common data models for use with a longitudinal community registry.

Authors: Maryam Garza; Guilherme Del Fiol; Jessica Tenenbaum; Anita Walden; Meredith Nahm Zozus
Journal: J Biomed Inform Date: 2016-10-29 Impact factor: 6.317

Review 5. A review of medical terminology standards and structured reporting.

Authors: Abdullah Awaysheh; Jeffrey Wilcke; François Elvinger; Loren Rees; Weiguo Fan; Kurt Zimmerman
Journal: J Vet Diagn Invest Date: 2017-10-15 Impact factor: 1.279

6. Incrementally Transforming Electronic Medical Records into the Observational Medical Outcomes Partnership Common Data Model: A Multidimensional Quality Assurance Approach.

Authors: Kristine E Lynch; Stephen A Deppen; Scott L DuVall; Benjamin Viernes; Aize Cao; Daniel Park; Elizabeth Hanchrow; Kushan Hewa; Peter Greaves; Michael E Matheny
Journal: Appl Clin Inform Date: 2019-10-23 Impact factor: 2.342

7. Transforming French Electronic Health Records into the Observational Medical Outcome Partnership's Common Data Model: A Feasibility Study.

Authors: Antoine Lamer; Nicolas Depas; Matthieu Doutreligne; Adrien Parrot; David Verloop; Marguerite-Marie Defebvre; Grégoire Ficheur; Emmanuel Chazard; Jean-Baptiste Beuscart
Journal: Appl Clin Inform Date: 2020-01-08 Impact factor: 2.342

8. Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model.

Authors: Jeffrey G Klann; Matthew A H Joss; Kevin Embree; Shawn N Murphy
Journal: PLoS One Date: 2019-02-19 Impact factor: 3.240

9. SHRINE: enabling nationally scalable multi-site disease studies.

Authors: Andrew J McMurry; Shawn N Murphy; Douglas MacFadden; Griffin Weber; William W Simons; John Orechia; Jonathan Bickel; Nich Wattanasin; Clint Gilbert; Philip Trevvett; Susanne Churchill; Isaac S Kohane
Journal: PLoS One Date: 2013-03-07 Impact factor: 3.240

10. Can We Rely on Results From IQVIA Medical Research Data UK Converted to the Observational Medical Outcome Partnership Common Data Model?: A Validation Study Based on Prescribing Codeine in Children.

Authors: Gianmario Candore; Karin Hedenmalm; Jim Slattery; Alison Cave; Xavier Kurz; Peter Arlett
Journal: Clin Pharmacol Ther Date: 2020-01-19 Impact factor: 6.875