| Literature DB >> 24303279 |
Jaideep Vaidya1, Basit Shafiq, Xiaoqian Jiang, Lucila Ohno-Machado.
Abstract
Health care data repositories play an important role in driving progress in medical research. Finding new pathways to discovery requires having adequate data and relevant analysis. However, it is critical to ensure the privacy and security of the stored data. In this paper, we identify a dangerous inference attack against naive suppression based approaches that are used to protect sensitive information. We base our attack on the querying system provided by the Healthcare Cost and Utilization Project, though it applies in general to any medical database providing a query capability. We also discuss potential solutions to this problem.Entities:
Year: 2013 PMID: 24303279 PMCID: PMC3845790
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1:
HCUPnet released data
Figure 2:
HCUPnet inferred data
Inferring the bounds of x11
| (a) Minimization problem |
|---|
|
min :
|
| /* Variable bounds */ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
all xij are integral values
|
| Race/Ethnicity | ||||||||
|---|---|---|---|---|---|---|---|---|
| Total | White | Black | Hispanic | Asian / Pacific Islander | Native American | Other | Missing | |
| Total number of discharges | 735 | 535 | 82 | 58 | 18 | * | 19 | 22 |
| Mean Costs | 15,101 | 14,835 | 18,903 | 16,006 | 15,628 | * | 20,774 | 9,813 |