Literature DB >> 29298592

Linking temporal medical records using non-protected health information data.

Luca Bonomi1, Xiaoqian Jiang1.   

Abstract

Modern medical research relies on multi-institutional collaborations which enhance the knowledge discovery and data reuse. While these collaborations allow researchers to perform analytics otherwise impossible on individual datasets, they often pose significant challenges in the data integration process. Due to the lack of a unique identifier, data integration solutions often have to rely on patient's protected health information (PHI). In many situations, such information cannot leave the institutions or must be strictly protected. Furthermore, the presence of noisy values for these attributes may result in poor overall utility. While much research has been done to address these challenges, most of the current solutions are designed for a static setting without considering the temporal information of the data (e.g. EHR). In this work, we propose a novel approach that uses non-PHI for linking patient longitudinal data. Specifically, our technique captures the diagnosis dependencies using patterns which are shown to provide important indications for linking patient records. Our solution can be used as a standalone technique to perform temporal record linkage using non-protected health information data or it can be combined with Privacy Preserving Record Linkage solutions (PPRL) when protected health information is available. In this case, our approach can solve ambiguities in results. Experimental evaluations on real datasets demonstrate the effectiveness of our technique.

Entities:  

Keywords:  EHR data; Record linkage; data mining; sequential patterns; temporal data

Mesh:

Year:  2017        PMID: 29298592      PMCID: PMC5758434          DOI: 10.1177/0962280217698005

Source DB:  PubMed          Journal:  Stat Methods Med Res        ISSN: 0962-2802            Impact factor:   3.021


  14 in total

Review 1.  Machine learning for medical diagnosis: history, state of the art and perspective.

Authors:  I Kononenko
Journal:  Artif Intell Med       Date:  2001-08       Impact factor: 5.326

2.  Validation of a common data model for active safety surveillance research.

Authors:  J Marc Overhage; Patrick B Ryan; Christian G Reich; Abraham G Hartzema; Paul E Stang
Journal:  J Am Med Inform Assoc       Date:  2011-10-28       Impact factor: 4.497

3.  Efficient q-gram filters for finding all epsilon-matches over a given length.

Authors:  Kim R Rasmussen; Jens Stoye; Eugene W Myers
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

4.  A framework for mining signatures from event sequences and its applications in healthcare data.

Authors:  Fei Wang; Noah Lee; Jianying Hu; Jimeng Sun; Shahram Ebadollahi; Andrew F Laine
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-02       Impact factor: 6.226

Review 5.  Health information systems - past, present, future.

Authors:  Reinhold Haux
Journal:  Int J Med Inform       Date:  2005-09-19       Impact factor: 4.046

Review 6.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

7.  Privacy-preserving record linkage using Bloom filters.

Authors:  Rainer Schnell; Tobias Bachteler; Jörg Reiher
Journal:  BMC Med Inform Decis Mak       Date:  2009-08-25       Impact factor: 2.796

8.  Next-generation phenotyping of electronic health records.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2012-09-06       Impact factor: 4.497

9.  Heart failure in the diabetic population - pathophysiology, diagnosis and management.

Authors:  Jacek Kasznicki; Jozef Drzewoski
Journal:  Arch Med Sci       Date:  2014-06-27       Impact factor: 3.318

10.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

View more
  4 in total

1.  Privacy-Preserving Methods for Vertically Partitioned Incomplete Data.

Authors:  Yi Deng; Xiaoqian Jiang; Qi Long
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

2.  Patient ranking with temporally annotated data.

Authors:  Luca Bonomi; Xiaoqian Jiang
Journal:  J Biomed Inform       Date:  2017-12-19       Impact factor: 6.317

3.  Federated learning algorithms for generalized mixed-effects model (GLMM) on horizontally partitioned data from distributed sources.

Authors:  Wentao Li; Jiayi Tong; Md Monowar Anjum; Noman Mohammed; Yong Chen; Xiaoqian Jiang
Journal:  BMC Med Inform Decis Mak       Date:  2022-10-16       Impact factor: 3.298

4.  Conceptual Design, Implementation, and Evaluation of Generic and Standard-Compliant Data Transfer into Electronic Health Records.

Authors:  Rogério Blitz; Martin Dugas
Journal:  Appl Clin Inform       Date:  2020-05-27       Impact factor: 2.342

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.