Literature DB >> 35878030

Balancing data privacy and usability in the federal statistical system.

V Joseph Hotz1, Christopher R Bollinger2, Tatiana Komarova3, Charles F Manski4, Robert A Moffitt5, Denis Nekipelov6, Aaron Sojourner7, Bruce D Spencer8.   

Abstract

The federal statistical system is experiencing competing pressures for change. On the one hand, for confidentiality reasons, much socially valuable data currently held by federal agencies is either not made available to researchers at all or only made available under onerous conditions. On the other hand, agencies which release public databases face new challenges in protecting the privacy of the subjects in those databases, which leads them to consider releasing fewer data or masking the data in ways that will reduce their accuracy. In this essay, we argue that the discussion has not given proper consideration to the reduced social benefits of data availability and their usability relative to the value of increased levels of privacy protection. A more balanced benefit-cost framework should be used to assess these trade-offs. We express concerns both with synthetic data methods for disclosure limitation, which will reduce the types of research that can be reliably conducted in unknown ways, and with differential privacy criteria that use what we argue is an inappropriate measure of disclosure risk. We recommend that the measure of disclosure risk used to assess all disclosure protection methods focus on what we believe is the risk that individuals should care about, that more study of the impact of differential privacy criteria and synthetic data methods on data usability for research be conducted before either is put into widespread use, and that more research be conducted on alternative methods of disclosure risk reduction that better balance benefits and costs.

Entities:  

Keywords:  data access; data disclosure risk; federal statistical system

Mesh:

Year:  2022        PMID: 35878030      PMCID: PMC9351352          DOI: 10.1073/pnas.2104906119

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


  3 in total

1.  How differential privacy will affect our understanding of health disparities in the United States.

Authors:  Alexis R Santos-Lozada; Jeffrey T Howard; Ashton M Verdery
Journal:  Proc Natl Acad Sci U S A       Date:  2020-05-28       Impact factor: 11.205

2.  The Role of Chance in the Census Bureau Database Reconstruction Experiment.

Authors:  Steven Ruggles; David Van Riper
Journal:  Popul Res Policy Rev       Date:  2021-08-22

3.  The use of differential privacy for census data and its impact on redistricting: The case of the 2020 U.S. Census.

Authors:  Christopher T Kenny; Shiro Kuriwaki; Cory McCartan; Evan T R Rosenman; Tyler Simko; Kosuke Imai
Journal:  Sci Adv       Date:  2021-10-06       Impact factor: 14.136

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.