John D Kraemer1, Andrew A Strasser2, Eric N Lindblom3, Raymond S Niaura4, Darren Mays5. 1. Department of Health Systems Administration, School of Nursing & Health Studies, Georgetown University, Washington, DC, United States. 2. Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States. 3. O'Neill Institute for National and Global Health Law, Georgetown University Law Center, Washington, DC, United States. 4. Schroeder Institute for Tobacco Research and Policy Studies, Truth Initiative, Washington, DC, United States; Department of Health, Behavior, and Society, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States; Department of Oncology, Georgetown University Medical Center, Cancer Prevention & Control Program, Lombardi Comprehensive Cancer Center, Washington, DC, United States. 5. Department of Oncology, Georgetown University Medical Center, Cancer Prevention & Control Program, Lombardi Comprehensive Cancer Center, Washington, DC, United States. Electronic address: dmm239@georgetown.edu.
Abstract
INTRODUCTION: Internet-based crowdsourcing is increasingly used for social and behavioral research in public health, however the potential generalizability of crowdsourced data remains unclear. This study assessed the population representativeness of Internet-based crowdsourced data. METHODS: A total of 3999 U.S. young adults ages 18 to 30years were recruited in 2016 through Internet-based crowdsourcing to complete measures taken from the 2012-2013 National Adult Tobacco Survey (NATS). Post-hoc sampling weights were created using procedures similar to the NATS. Weighted analyses were conducted in 2016 to compare crowdsourced and publicly-available 2012-2013 NATS data on demographics, tobacco use, and measures of tobacco perceptions and product warning label exposure. RESULTS: Those in the crowdsourced sample were less likely to report an annual household income of $50,000 or greater, and e-cigarette, waterpipe, and cigar use were more prevalent in the crowdsourced sample. High proportions of both samples indicated cigarette smoking is very harmful and very addictive. Comparable proportions of non-smokers and smokers reported cigarette warning label exposure, however the likelihood of reporting that smoking is very harmful by frequency of warning label exposure was lower among smokers in the crowdsourced sample. CONCLUSIONS: Our findings indicate that crowdsourced samples may differ demographically and may not produce generalizable estimates of tobacco use prevalence relative to population data after post-hoc sample weighting. However, correlational analyses in crowdsourced samples may reasonably approximate population data. Future studies can build from this work by testing additional methodological strategies to improve crowdsourced sampling strategies.
INTRODUCTION: Internet-based crowdsourcing is increasingly used for social and behavioral research in public health, however the potential generalizability of crowdsourced data remains unclear. This study assessed the population representativeness of Internet-based crowdsourced data. METHODS: A total of 3999 U.S. young adults ages 18 to 30years were recruited in 2016 through Internet-based crowdsourcing to complete measures taken from the 2012-2013 National Adult Tobacco Survey (NATS). Post-hoc sampling weights were created using procedures similar to the NATS. Weighted analyses were conducted in 2016 to compare crowdsourced and publicly-available 2012-2013 NATS data on demographics, tobacco use, and measures of tobacco perceptions and product warning label exposure. RESULTS: Those in the crowdsourced sample were less likely to report an annual household income of $50,000 or greater, and e-cigarette, waterpipe, and cigar use were more prevalent in the crowdsourced sample. High proportions of both samples indicated cigarette smoking is very harmful and very addictive. Comparable proportions of non-smokers and smokers reported cigarette warning label exposure, however the likelihood of reporting that smoking is very harmful by frequency of warning label exposure was lower among smokers in the crowdsourced sample. CONCLUSIONS: Our findings indicate that crowdsourced samples may differ demographically and may not produce generalizable estimates of tobacco use prevalence relative to population data after post-hoc sample weighting. However, correlational analyses in crowdsourced samples may reasonably approximate population data. Future studies can build from this work by testing additional methodological strategies to improve crowdsourced sampling strategies.
Authors: Noel T Brewer; Jennifer C Morgan; Sabeeh A Baig; Jennifer R Mendel; Marcella H Boynton; Jessica K Pepper; M Justin Byron; Seth M Noar; Robert P Agans; Kurt M Ribisl Journal: Tob Control Date: 2016-12-06 Impact factor: 7.552
Authors: Jennifer L Pearson; Amanda Richardson; Shari P Feirman; Andrea C Villanti; Jennifer Cantrell; Amy Cohn; Michael Tacelosky; Thomas R Kirchner Journal: Nicotine Tob Res Date: 2016-05-10 Impact factor: 4.244
Authors: Karin A Kasza; Bridget K Ambrose; Kevin P Conway; Nicolette Borek; Kristie Taylor; Maciej L Goniewicz; K Michael Cummings; Eva Sharma; Jennifer L Pearson; Victoria R Green; Annette R Kaufman; Maansi Bansal-Travers; Mark J Travers; Jonathan Kwan; Cindy Tworek; Yu-Ching Cheng; Ling Yang; Nikolas Pharris-Ciurej; Dana M van Bemmel; Cathy L Backinger; Wilson M Compton; Andrew J Hyland Journal: N Engl J Med Date: 2017-01-26 Impact factor: 91.245
Authors: Israel T Agaku; Brian A King; Corinne G Husten; Rebecca Bunnell; Bridget K Ambrose; S Sean Hu; Enver Holder-Hayes; Hannah R Day Journal: MMWR Morb Mortal Wkly Rep Date: 2014-06-27 Impact factor: 17.586
Authors: Jessica L King; Allison Lazard; Beth A Reboussin; Leah Ranney; Jennifer Cornacchione Ross; Kimberly G Wagoner; Erin L Sutfin Journal: Nicotine Tob Res Date: 2020-04-21 Impact factor: 4.244
Authors: Matthew G Kirkpatrick; Tess Boley Cruz; Jennifer B Unger; Josseline Herrera; Sara Schiff; Jon-Patrick Allem Journal: Drug Alcohol Depend Date: 2019-06-08 Impact factor: 4.492
Authors: Lilianna Phan; Darren Mays; Kenneth P Tercyak; Andrea C Johnson; Kathryn Rehberg; Isaac M Lipkus Journal: Transl Behav Med Date: 2021-02-11 Impact factor: 3.046
Authors: Michelle Jeong; Dongyu Zhang; Jennifer C Morgan; Jennifer Cornacchione Ross; Amira Osman; Marcella H Boynton; Jennifer R Mendel; Noel T Brewer Journal: Ann Behav Med Date: 2019-03-28
Authors: Andrea C Johnson; Isaac Lipkus; Kenneth P Tercyak; George Luta; Kathryn Rehberg; Lilianna Phan; Lorien C Abroms; Darren Mays Journal: Health Educ Behav Date: 2019-12