Literature DB >> 25570216

An empiric weight computation for record linkage using linearly combined fields' similarity scores.

Xinran Li, Aline Guttmann, Jacques Demongeot, Jean-Yves Boire, Lemlih Ouchchane.   

Abstract

Record linkage is the task of identifying which records from one or more data sources refer to the same entity. Many record linkage methods were introduced and applied over the last decades. In general, the principle is to compare a range of available identifier fields in record pairs among different data sources, in order to make a linkage decision. The Fellegi-Sunter probabilistic record linkage (PRL-FS) is one of the most commonly used methods. To obtain a better performance, Winkler proposed an enhanced PRL-FS method (PRL-W) that takes into account field similarity, but its implementation requires the estimation of much more parameters which complicates the task. Consequently, we propose to develop a method that contains the best features in the PRL-FS and the PRL-W methods: simplicity of parameters estimation and consideration of fields' similarities. We hypothesize that our record linkage method outperforms the PRL-FS, and can achieve a similar performance of the PRL-W. This paper presents briefly the PRL-FS and PRL-W methods, and describes in details how to combine fields' similarity scores to create a novel record pair weight. Simulated data sets were used to assess and to compare these three methods regarding their ability to reduce the rates of false matches and false non-matches.

Mesh:

Year:  2014        PMID: 25570216     DOI: 10.1109/EMBC.2014.6943848

Source DB:  PubMed          Journal:  Conf Proc IEEE Eng Med Biol Soc        ISSN: 1557-170X


  1 in total

1.  Do GPs know their patients with cancer? Assessing the quality of cancer registration in Dutch primary care: a cross-sectional validation study.

Authors:  Annet Sollie; Jessika Roskam; Rolf H Sijmons; Mattijs E Numans; Charles W Helsper
Journal:  BMJ Open       Date:  2016-09-15       Impact factor: 2.692

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.