| Literature DB >> 36260399 |
Cason Schmit1, Hye-Chung Kum1, Theodoros Giannouchos1,2, Mahin Ramezani1, Qi Zheng3, Michael A Morrisey1.
Abstract
BACKGROUND: Reaping the benefits from massive volumes of data collected in all sectors to improve population health, inform personalized medicine, and transform biomedical research requires the delicate balance between the benefits and risks of using individual-level data. There is a patchwork of US data protection laws that vary depending on the type of data, who is using it, and their intended purpose. Differences in these laws challenge big data projects using data from different sources. The decisions to permit or restrict data uses are determined by elected officials; therefore, constituent input is critical to finding the right balance between individual privacy and public benefits.Entities:
Keywords: big data; conjoint analysis; health policy; information dissemination; law; medical informatics; privacy; public health; public policy; surveys and questionnaires
Mesh:
Year: 2021 PMID: 36260399 PMCID: PMC8406123 DOI: 10.2196/25266
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Attributes and levels for data reuse scenarios.
| Who | Purpose | Source of identifiable data |
| Researcher, University | Research, scientific knowledge dissemination | Education records |
| Nonprofit Organization | Promoting population health | Health records |
| Government | Identify criminal activity | Government program or activity |
| Business | Marketing, recruitment | Economic activity, customer behavior |
Figure 1Sample pair scenario question.
Sociodemographic data, clinical characteristics, and privacy attitude scores of the participants (N=504).
| Participant characteristics | Values | Target sample percentagea,b | |||
|
| |||||
|
| 18-24 | 41 (8.1) | 13.1 | ||
|
| 25-34 | 75 (14.9) | 17.5 | ||
|
| 35-44 | 100 (19.8) | 17.5 | ||
|
| 45-54 | 101 (20.0) | 19.2 | ||
|
| 55-64 | 68 (13.5) | 15.6 | ||
|
| 65 or older | 89 (17.7) | 17.2 | ||
|
| |||||
|
| Male | 224 (44.4) | 48.5 | ||
|
| Female | 278 (55.2) | 50.5 | ||
|
| Other/prefer not to answer | 2 (0.4) | —c | ||
|
| |||||
|
| White | 315 (62.5) | 63.7 | ||
|
| African American | 77 (15.3) | 12.2 | ||
|
| Hispanic | 51 (10.1) | 16.4 | ||
|
| Asian | 46 (9.1) | 4.7 | ||
|
| Other | 15 (3.0) | 3.0 | ||
|
| |||||
|
| $20,000 or less | 103 (20.4) | 19.9 | ||
|
| $20,000 to $49,999 | 149 (29.6) | 30.6 | ||
|
| $50,000 to $99,999 | 137 (27.2) | 29.1 | ||
|
| $100,000 to $149,999 | 67 (13.3) | 12.0 | ||
|
| $150,000 or more | 48 (9.5) | 8.3 | ||
|
| |||||
|
| High school or less | 172 (34.1) | 32.0 | ||
|
| Some college completed | 99 (19.6) | 19.0 | ||
|
| College degree | 191 (37.9) | 31.0 | ||
|
| Master’s | 37 (7.3) | — | ||
|
| PhD/doctoral | 5 (1.0) | — | ||
|
| |||||
|
| Midwest | 95 (18.8) | 22.0 | ||
|
| Northeast | 126 (25.0) | 18.2 | ||
|
| South | 174 (34.5) | 36.2 | ||
|
| West | 109 (21.6) | 23.6 | ||
|
| |||||
|
| Private | 169 (33.5) | 64.7 | ||
|
| Medicare | 112 (22.2) | 17.7 | ||
|
| Medicaid | 83 (16.5) | 17.9 | ||
|
| Uninsured | 52 (10.3) | 8.5 | ||
|
| VA/TRICARE | 10 (2.0) | 3.6 | ||
|
| Multiple | 78 (15.5) | 14.5 | ||
|
| |||||
|
| No | 319 (63.3) | — | ||
|
| Yes | 181 (35.9) | — | ||
|
| |||||
|
| No | 93 (18.5) | — | ||
|
| Yes | 256 (50.8) | — | ||
|
| |||||
|
| No | 404 (80.2) | — | ||
|
| Yes | 100 (19.8) | — | ||
|
| |||||
|
| No | 423 (83.9) | — | ||
|
| Yes | 77 (15.3) | — | ||
| Concern for information privacy scores, mean (SD) | 5.8 (1.1) | — | |||
aSurvey sampling targets based on census data.
bInsurance data were not used as the sampling target. These data show 2018 insurance statistics from the US census for survey sampling comparisons [45]. Our survey solicited mutually exclusive responses in contrast to the US census data, which do not exclude persons with multiple insurance types from these groups.
cNot available.
Figure 2Relative importance by level within each attribute in percentage (SD).
Figure 3Public preferences for use of data by users and purpose in percentage (SD). Our survey did not pair “for-profit” purposes with government or nonprofit users because these pairings were implausible and likely to confuse survey respondents.
Figure 4Top 10 and bottom 10 ranked data use scenarios derived from the sum of scenario attributes' relative values (who/use purpose/data source) in percentage (SD).