Sean D Young1, Caitlin Rivers2, Bryan Lewis2. 1. Department of Family Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA. Electronic address: sdyoung@mednet.ucla.edu. 2. Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA.
Abstract
OBJECTIVE: Recent availability of "big data" might be used to study whether and how sexual risk behaviors are communicated on real-time social networking sites and how data might inform HIV prevention and detection. This study seeks to establish methods of using real-time social networking data for HIV prevention by assessing 1) whether geolocated conversations about HIV risk behaviors can be extracted from social networking data, 2) the prevalence and content of these conversations, and 3) the feasibility of using HIV risk-related real-time social media conversations as a method to detect HIV outcomes. METHODS: In 2012, tweets (N=553,186,061) were collected online and filtered to include those with HIV risk-related keywords (e.g., sexual behaviors and drug use). Data were merged with AIDSVU data on HIV cases. Negative binomial regressions assessed the relationship between HIV risk tweeting and prevalence by county, controlling for socioeconomic status measures. RESULTS: Over 9800 geolocated tweets were extracted and used to create a map displaying the geographical location of HIV-related tweets. There was a significant positive relationship (p<.01) between HIV-related tweets and HIV cases. CONCLUSION: Results suggest the feasibility of using social networking data as a method for evaluating and detecting Human immunodeficiency virus (HIV) risk behaviors and outcomes.
OBJECTIVE: Recent availability of "big data" might be used to study whether and how sexual risk behaviors are communicated on real-time social networking sites and how data might inform HIV prevention and detection. This study seeks to establish methods of using real-time social networking data for HIV prevention by assessing 1) whether geolocated conversations about HIV risk behaviors can be extracted from social networking data, 2) the prevalence and content of these conversations, and 3) the feasibility of using HIV risk-related real-time social media conversations as a method to detect HIV outcomes. METHODS: In 2012, tweets (N=553,186,061) were collected online and filtered to include those with HIV risk-related keywords (e.g., sexual behaviors and drug use). Data were merged with AIDSVU data on HIV cases. Negative binomial regressions assessed the relationship between HIV risk tweeting and prevalence by county, controlling for socioeconomic status measures. RESULTS: Over 9800 geolocated tweets were extracted and used to create a map displaying the geographical location of HIV-related tweets. There was a significant positive relationship (p<.01) between HIV-related tweets and HIV cases. CONCLUSION: Results suggest the feasibility of using social networking data as a method for evaluating and detecting Human immunodeficiency virus (HIV) risk behaviors and outcomes.
Authors: Sean D Young; William G Cumberland; Sung-Jae Lee; Devan Jaganath; Greg Szekeres; Thomas Coates Journal: Ann Intern Med Date: 2013-09-03 Impact factor: 25.391
Authors: Judy Gold; Alisa E Pedrana; Rachel Sacks-Davis; Margaret E Hellard; Shanton Chang; Steve Howard; Louise Keogh; Jane S Hocking; Mark A Stoove Journal: BMC Public Health Date: 2011-07-21 Impact factor: 3.295
Authors: Patrick S Sullivan; Christine M Khosropour; Nicole Luisi; Matthew Amsden; Tom Coggia; Gina M Wingood; Ralph J DiClemente Journal: J Med Internet Res Date: 2011-05-13 Impact factor: 5.428
Authors: Ian W Holloway; Shannon Dunlap; Homero E Del Pino; Keith Hermanstyne; Craig Pulsipher; Raphael J Landovitz Journal: Curr Addict Rep Date: 2014-09
Authors: Xinzhi Zhang; Eliseo J Pérez-Stable; Philip E Bourne; Emmanuel Peprah; O Kenrik Duru; Nancy Breen; David Berrigan; Fred Wood; James S Jackson; David W S Wong; Joshua Denny Journal: Ethn Dis Date: 2017-04-20 Impact factor: 1.847