OBJECTIVE: Integration of patients' records across resources enhances analytics. To address privacy concerns, emerging strategies such as Bloom filter encodings (BFEs), enable integration while obscuring identifiers. However, recent investigations demonstrate BFEs are, in theory, vulnerable to cryptanalysis when encoded identifiers are randomly selected from a public resource. This study investigates the extent to which cryptanalysis conditions hold for (1) real patient records and (2) a countermeasure that obscures the frequencies of the identifying values in encoded datasets. DESIGN: First, to investigate the strength of cryptanalysis for real patient records, we build BFEs from identifiers in an electronic medical record system and apply cryptanalysis using identifiers in a publicly available voter registry. Second, to investigate the countermeasure under ideal cryptanalysis conditions, we compose BFEs from the identifiers that are randomly selected from a public voter registry. MEASUREMENT: We utilize precision (ie, rate of correct re-identified encodings) and computation efficiency (ie, time to complete cryptanalysis) to assess the performance of cryptanalysis in BFEs before and after application of the countermeasure. RESULTS: Cryptanalysis can achieve high precision when the encoded identifiers are composed of a random sample of a public resource (ie, a voter registry). However, we also find that the attack is less efficient and may not be practical for more realistic scenarios. By contrast, the proposed countermeasure made cryptanalysis impractical in terms of precision and efficiency. CONCLUSIONS: Performance of cryptanalysis against BFEs based on patient data is significantly lower than theoretical estimates. The proposed countermeasure makes BFEs resistant to known practical attacks.
OBJECTIVE: Integration of patients' records across resources enhances analytics. To address privacy concerns, emerging strategies such as Bloom filter encodings (BFEs), enable integration while obscuring identifiers. However, recent investigations demonstrate BFEs are, in theory, vulnerable to cryptanalysis when encoded identifiers are randomly selected from a public resource. This study investigates the extent to which cryptanalysis conditions hold for (1) real patient records and (2) a countermeasure that obscures the frequencies of the identifying values in encoded datasets. DESIGN: First, to investigate the strength of cryptanalysis for real patient records, we build BFEs from identifiers in an electronic medical record system and apply cryptanalysis using identifiers in a publicly available voter registry. Second, to investigate the countermeasure under ideal cryptanalysis conditions, we compose BFEs from the identifiers that are randomly selected from a public voter registry. MEASUREMENT: We utilize precision (ie, rate of correct re-identified encodings) and computation efficiency (ie, time to complete cryptanalysis) to assess the performance of cryptanalysis in BFEs before and after application of the countermeasure. RESULTS: Cryptanalysis can achieve high precision when the encoded identifiers are composed of a random sample of a public resource (ie, a voter registry). However, we also find that the attack is less efficient and may not be practical for more realistic scenarios. By contrast, the proposed countermeasure made cryptanalysis impractical in terms of precision and efficiency. CONCLUSIONS: Performance of cryptanalysis against BFEs based on patient data is significantly lower than theoretical estimates. The proposed countermeasure makes BFEs resistant to known practical attacks.
Authors: Rachel E Behrman; Joshua S Benner; Jeffrey S Brown; Mark McClellan; Janet Woodcock; Richard Platt Journal: N Engl J Med Date: 2011-01-12 Impact factor: 91.245
Authors: Hye-Chung Kum; Ashok Krishnamurthy; Ashwin Machanavajjhala; Michael K Reiter; Stanley Ahalt Journal: J Am Med Inform Assoc Date: 2013-11-07 Impact factor: 4.497
Authors: Abel N Kho; John P Cashy; Kathryn L Jackson; Adam R Pah; Satyender Goel; Jörn Boehnke; John Eric Humphries; Scott Duke Kominers; Bala N Hota; Shannon A Sims; Bradley A Malin; Dustin D French; Theresa L Walunas; David O Meltzer; Erin O Kaleba; Roderick C Jones; William L Galanter Journal: J Am Med Inform Assoc Date: 2015-06-23 Impact factor: 4.497
Authors: Elizabeth Ashley Durham; Murat Kantarcioglu; Yuan Xue; Csaba Toth; Mehmet Kuzu; Bradley Malin Journal: IEEE Trans Knowl Data Eng Date: 2014-12 Impact factor: 6.977
Authors: Sean Ekins; Alex M Clark; S Joshua Swamidass; Nadia Litterman; Antony J Williams Journal: J Comput Aided Mol Des Date: 2014-06-19 Impact factor: 3.686
Authors: Mindy K Ross; Javier Sanz; Brian Tep; Rob Follett; Spencer L Soohoo; Douglas S Bell Journal: Appl Clin Inform Date: 2020-11-04 Impact factor: 2.342