Veronica Yank1, Sanjhavi Agarwal1, Pooja Loftus1, Steven Asch1, David Rehkopf1. 1. Veronica Yank and Sanjhavi Agarwal are with the Division of General Internal Medicine, University of California, San Francisco. Pooja Loftus is with the Division of General Medical Disciplines, Stanford University, Stanford, CA. Steven Asch is with the VA Palo Alto Health Care System, Palo Alto, CA, and the Division of General Medical Disciplines, Stanford University. David Rehkopf is with the Division of General Medical Disciplines, Stanford University.
Abstract
OBJECTIVES: To determine the generalizability of crowdsourced, electronic health data from self-selected individuals using a national survey as a reference. METHODS: Using the world's largest crowdsourcing platform in 2015, we collected data on characteristics known to influence cardiovascular disease risk and identified comparable data from the 2013 Behavioral Risk Factor Surveillance System. We used age-stratified logistic regression models to identify differences among groups. RESULTS: Crowdsourced respondents were younger, more likely to be non-Hispanic and White, and had higher educational attainment. Those aged 40 to 59 years were similar to US adults in the rates of smoking, diabetes, hypertension, and hyperlipidemia. Those aged 18 to 39 years were less similar, whereas those aged 60 to 75 years were underrepresented among crowdsourced respondents. CONCLUSIONS: Crowdsourced health data might be most generalizable to adults aged 40 to 59 years, but studies of younger or older populations, racial and ethnic minorities, or those with lower educational attainment should approach crowdsourced data with caution. Public Health Implications. Policymakers, the national Precision Medicine Initiative, and others planning to use crowdsourced data should take explicit steps to define and address anticipated underrepresentation by important population subgroups.
OBJECTIVES: To determine the generalizability of crowdsourced, electronic health data from self-selected individuals using a national survey as a reference. METHODS: Using the world's largest crowdsourcing platform in 2015, we collected data on characteristics known to influence cardiovascular disease risk and identified comparable data from the 2013 Behavioral Risk Factor Surveillance System. We used age-stratified logistic regression models to identify differences among groups. RESULTS: Crowdsourced respondents were younger, more likely to be non-Hispanic and White, and had higher educational attainment. Those aged 40 to 59 years were similar to US adults in the rates of smoking, diabetes, hypertension, and hyperlipidemia. Those aged 18 to 39 years were less similar, whereas those aged 60 to 75 years were underrepresented among crowdsourced respondents. CONCLUSIONS: Crowdsourced health data might be most generalizable to adults aged 40 to 59 years, but studies of younger or older populations, racial and ethnic minorities, or those with lower educational attainment should approach crowdsourced data with caution. Public Health Implications. Policymakers, the national Precision Medicine Initiative, and others planning to use crowdsourced data should take explicit steps to define and address anticipated underrepresentation by important population subgroups.
Authors: Patricia A Cavazos-Rehg; Melissa J Krauss; Edward L Spitznagel; Ashley Lowery; Richard A Grucza; Frank J Chaloupka; Laura Jean Bierut Journal: Tob Control Date: 2014-02-05 Impact factor: 7.552
Authors: Arjun K Manrai; Birgit H Funke; Heidi L Rehm; Morten S Olesen; Bradley A Maron; Peter Szolovits; David M Margulies; Joseph Loscalzo; Isaac S Kohane Journal: N Engl J Med Date: 2016-08-18 Impact factor: 91.245
Authors: Daniel Capurro; Kate Cole; Maria I Echavarría; Jonathan Joe; Tina Neogi; Anne M Turner Journal: J Med Internet Res Date: 2014-03-14 Impact factor: 5.428
Authors: Michelle Jeong; Dongyu Zhang; Jennifer C Morgan; Jennifer Cornacchione Ross; Amira Osman; Marcella H Boynton; Jennifer R Mendel; Noel T Brewer Journal: Ann Behav Med Date: 2019-03-28
Authors: Kelsey Moriarty; Susan M Wolf; Patricia M Veach; Bonnie LeRoy; Ian M MacFarlane; Heather A Zierhut Journal: Per Med Date: 2020-08-17 Impact factor: 2.512
Authors: Mutlu Demirer; Sema Candemir; Matthew T Bigelow; Sarah M Yu; Vikash Gupta; Luciano M Prevedello; Richard D White; Joseph S Yu; Rainer Grimmer; Michael Wels; Andreas Wimmer; Abdul H Halabi; Alvin Ihsani; Thomas P O'Donnell; Barbaros S Erdal Journal: Radiol Artif Intell Date: 2019-11-27