| Literature DB >> 19706187 |
Rainer Schnell1, Tobias Bachteler, Jörg Reiher.
Abstract
BACKGROUND: Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns.Entities:
Mesh:
Year: 2009 PMID: 19706187 PMCID: PMC2753305 DOI: 10.1186/1472-6947-9-41
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Figure 1Example of the use of two Bloom filters for the privacy-preserving computation of string similarities.
Figure 2Comparison of precision and recall for Bloom filters with unencrypted trigrams using simulated data.
Figure 3Comparison of precision and recall for Bloom filters with exact string comparison using simulated data.
Figure 4Comparison of precision and recall for Bloom filters with Soundex using simulated data.
Figure 5Comparison of precision and recall for Bloom filters with unencrypted bigrams using actual data.
Figure 6Comparison of precision and recall for Bloom filters with a phonetic encoding using actual data.
Figure 7Rescaled cutout of figure 6 highlighting recall levels above .75.