Literature DB >> 26527579

A normalization method for combination of laboratory test results from different electronic healthcare databases in a distributed research network.

Dukyong Yoon1,2, Martijn J Schuemie2,3, Ju Han Kim4, Dong Ki Kim5, Man Young Park6, Eun Kyoung Ahn1, Eun-Young Jung7, Dong Kyun Park7, Soo Yeon Cho1, Dahye Shin1, Yeonsoo Hwang8, Rae Woong Park1,2.   

Abstract

PURPOSE: Distributed research networks (DRNs) afford statistical power by integrating observational data from multiple partners for retrospective studies. However, laboratory test results across care sites are derived using different assays from varying patient populations, making it difficult to simply combine data for analysis. Additionally, existing normalization methods are not suitable for retrospective studies. We normalized laboratory results from different data sources by adjusting for heterogeneous clinico-epidemiologic characteristics of the data and called this the subgroup-adjusted normalization (SAN) method.
METHODS: Subgroup-adjusted normalization renders the means and standard deviations of distributions identical under population structure-adjusted conditions. To evaluate its performance, we compared SAN with existing methods for simulated and real datasets consisting of blood urea nitrogen, serum creatinine, hematocrit, hemoglobin, serum potassium, and total bilirubin. Various clinico-epidemiologic characteristics can be applied together in SAN. For simplicity of comparison, age and gender were used to adjust population heterogeneity in this study.
RESULTS: In simulations, SAN had the lowest standardized difference in means (SDM) and Kolmogorov-Smirnov values for all tests (p < 0.05). In a real dataset, SAN had the lowest SDM and Kolmogorov-Smirnov values for blood urea nitrogen, hematocrit, hemoglobin, and serum potassium, and the lowest SDM for serum creatinine (p < 0.05).
CONCLUSION: Subgroup-adjusted normalization performed better than normalization using other methods. The SAN method is applicable in a DRN environment and should facilitate analysis of data integrated across DRN partners for retrospective observational studies.
Copyright © 2015 John Wiley & Sons, Ltd.

Entities:  

Keywords:  distributed research networks; electronic health records; laboratory test; normalization; pharmacoepidemiology

Mesh:

Year:  2015        PMID: 26527579     DOI: 10.1002/pds.3893

Source DB:  PubMed          Journal:  Pharmacoepidemiol Drug Saf        ISSN: 1053-8569            Impact factor:   2.890


  2 in total

Review 1.  Clinical Research Informatics: Supporting the Research Study Lifecycle.

Authors:  S B Johnson
Journal:  Yearb Med Inform       Date:  2017-09-11

2.  Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals.

Authors:  Eugene Jeong; Namgi Park; Young Choi; Rae Woong Park; Dukyong Yoon
Journal:  PLoS One       Date:  2018-11-21       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.