| Literature DB >> 35194384 |
Maud Reveilhac1, Stephanie Steinmetz1, Davide Morselli1,2.
Abstract
In this article, we review existing research on the complementarity of social media data and survey data for the study of public opinion. We start by situating our review in the extensive literature (N = 187) about the uses, challenges, and frameworks related to the use of social media for studying public opinion. Based on 187 relevant articles (141 empirical and 46 theoretical) - we identify within the 141 empircal ones six main research approaches concerning the complementarity of both data sources. Results show that the biggest share of the research has focused on how social media can be used to confirm survey findings, especially for election predictions. The main contribution of our review is to detail and classify other growing complementarity approaches, such as comparing both data sources on a given phenomenon, using survey measures as a proxy in social media research, enriching surveys with SMD, recruiting individuals on social media to conduct a second survey phase, and generating new insight on "old" or "under-investigated" topics or theories using SMD. We discuss the advantages and disadvantages associated with each of these approaches in relation to four main research purposes, namely the improvement of validity, sustainability, reliability, and interpretability. We conclude by discussing some limitations of our study and highlighting future paths for research.Entities:
Keywords: Data complementarity; Public opinion; Social media; Survey data
Year: 2022 PMID: 35194384 PMCID: PMC8853237 DOI: 10.1007/s11042-022-12101-0
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.577
Differences between SMD and survey data to study PO
| SM | Survey | |
|---|---|---|
| Type of population | Selective | Representative |
| Data signals | Platform users and their signals (e.g. posts or tweets, #hashtags, @mentions, retweets, replies) | Opinion survey of individuals with a defined sampling frame |
| Types of data | Unstructured texts containing opinions and opinion strength or merely information (e.g. links) | Structured opinions measured by answers to pre-defined and pre-tested survey questions (scales) |
| Unit of observation or the level at which data are collected | Can be any of the following: users, search queries or keywords, #hashtags or @mentions, retweets, or replies, likes or emotional reactions, location | Individuals from a sampling frame representing a target population |
| Unit of analysis or level at which the data are analysed | Can be any of the following: users, location, texts from a specific topic or sentiment, overall texts, links, or other metadata | Individuals’ responses to survey items or aggregated responses at the country, region or household level |
| Meta-data | Set of users’ behavioural information (e.g. network, frequency of use, interactions) and contextual information (e.g. time and location) | Precise and quasi-complete socio-demographic information on individuals and auxiliary data (e.g. number of contact attempts, number of persons in the household) |
Fig. 1Number of empirical and theoretical articles according to our meta-review of the existing literature using surveys and SMD
List of theoretical papers focusing on the combination of survey and social media data (n = 46, continues next pages)
| Author(s) | Date | Title | Source | Focus |
|---|---|---|---|---|
| Blumenthal | 2005 | The Public Opinion Quarterly | General | |
| Gayo-Avello | 2011 | Communications of the ACM | Prediction | |
| Sobkowicz et al. | 2012 | Government Information Quarterly | Prediction | |
| Boyd & Crawford | 2012 | Information, Communication & Society | General | |
| Metaxas & Mustafaraj | 2012 | Science | Prediction | |
| Gayo-Avello | 2012 | arXiv Preprint | Prediction | |
| Gayo-Avello | 2012 | IEEE Internet Computing | Prediction | |
| Smith | 2013 | International Journal of Public Opinion Research | Ontology | |
| Couper | 2013 | Survey Research Methods | General | |
| Baker et al. | 2013 | Journal of Survey Statistics and Methodology | General | |
| Schoen et al. | 2013 | Internet Research | Prediction | |
| Gayo-Avello | 2013 | Social Science Computer Review | Prediction | |
| Stieglitz & Dang-Xuan | 2013 | Social Network Analysis and Mining volume | General | |
| Gayo-Avello et al. | 2013 | Internet Research | Prediction | |
| Ruths & Pfeffer | 2014 | Science | Behaviour | |
| Tufekci | 2014 | arXiv Preprint | Error sources | |
| Murphy et al. | 2014 | Book chapter (1): Social Media, Sociality, and Survey Research | Ontology | |
| Murphy et al. | 2014 | Public Opinion Quarterly | General | |
| Hill & Dever | 2014 | Book chapter (12): Social Media, Sociality, and Survey Research | General | |
| Tang et al. | 2014 | ACM SIGKDD Explorations Newsletter | Ontology | |
| Zagheni & Weber | 2015 | International Journal of Manpower | Demographic | |
| Resnick et al. | 2015 | The American Academy of Political and Social Science | General | |
| Ampofo et ak. | 2015 | Book chapter (8): Innovations in Digital Research Methods | Ontology | |
| Hargittai | 2015 | The American Academy of Political and Social Science | General | |
| Schober et al. | 2016 | Public Opinion Quarterly | General | |
| Olteanu et al. | 2016 | Frontiers in Big Data | Error sources | |
| Junngherr | 2016 | Journal of Information Technology & Politics | Prediction | |
| Spiro | 2016 | Current Opinion in Psychology | Linking | |
| RJ Dalton | 2016 | International Journal of Sociology | Behaviour | |
| Johnson & Smith | 2017 | Book chapter: Seeing Cities Through Big Data | Linking | |
| Hsieh & Murphy | 2017 | Book chapter (2): Total Survey Error in Practice | Error sources | |
| Salleh | 2017 | Science | General | |
| Pal | 2017 | Current Opinion in Behavioral Sciences | Small data | |
| Salunkhe et al. | 2017 | International Journal of Advanced Research in Science, Communication and Technology | Prediction | |
| Klašnja et al. | 2018 | Book chapter: The Oxford Handbook of Polling and Survey Methods | General | |
| Jungherr | 2018 | Book chapter: Digital Discussions | General | |
| Kwak & Cho | 2018 | Asian Journal for Public Opinion Research | Prediction | |
| Szreder | 2018 | Archives of Data Science | General | |
| Freelon | 2019 | Book chapter: Digital Discussions | Demographic | |
| Trottier | 2019 | Fast Capitalism | General | |
| Sen et al. | 2019 | arXiv Preprint | Error sources | |
| Salvatore et al. | 2020 | Social Indicators Research | Error sources | |
| Stier et al. | 2020 | Social Science Computer Review | Linking | |
| Romele et al. | 2020 | AI & SOCIETY | Ontology | |
| Skoric et al. | 2020 | Information | Prediction | |
| Rousidis et al. | 2020 | Multimedia Tools and Applications | Prediction | |
| Chauhan et al. | 2020 | Journal of Ambient Intelligence and Humanized Computing | Prediction |
Fig. 2Complementary approaches using SMD and survey data for the study of PO
List of publications combining social media and survey data for prediction purposes (n = 48, continues next pages)
| Author(s) | Date | Title | Source | SM | Topic | Level of analysis |
|---|---|---|---|---|---|---|
| Tumasjan et al. | 2011 | Social Science Computer Review | election | national | ||
| Aparaschivei | 2011 | Journal of Media Research-Revista de Studii Media | Twitter & Facebook & Youtube | election | national | |
| Jungherr et al. | 2012 | Social science computer review | election | national | ||
| Borondo et al. | 2012 | Chaos An Interdisciplinary Journal of Nonlinear Science | election | national | ||
| Jungherr et al. | 2012 | Social science computer review | election | national | ||
| Choy et al. | 2012 | arXiv Preprint | election | national | ||
| Borondo et al. | 2012 | Chaos: An Interdisciplinary Journal of Nonlinear Science | election | national | ||
| González-Bailón et al. | 2012 | Human Communication Research | other social media (Usenet) | presidential approval | national | |
| Franch | 2013 | Journal of Information Technology & Politics | Facebook, Twitter, Google, & YouTube | election | national | |
| Kermanidis & Maragoudakis | 2013 | Int. J. Social Network Mining | election | national | ||
| Fu & Chan | 2013 | Cyberpsychology, Behavior, and Social Networking | online discussion forums, personal blogs, and microblogs | election | local | |
| DiGrazia et al. | 2013 | PloS one | politics | district | ||
| Paul & Dredze | 2014 | PloS one | health | national | ||
| Cheng & Chen | 2014 | Aslib Journal of Information Management | politics | regional | ||
| Ceron et al. | 2014 | New media & society | politics | national | ||
| Ceron et al. | 2015 | Social Science Computer Review | election | national | ||
| Murthy | 2015 | Information, Communication & Society | election | State | ||
| Eom et al. | 2015 | PloS one | election | national | ||
| Wong et al. | 2015 | Journal of medical internet research | health | national | ||
| Durahim & Coşkun | 2015 | Technological Forecasting and Social Change | well-being | national & regional | ||
| Huberty | 2015 | International Journal of Forecasting | election | national | ||
| Jungherr et al. | 2015 | Social Science Computer Review | election | national | ||
| Burnap et al. | 2016 | Electoral Studies | election | national | ||
| Cody et al. | 2016 | arXiv Preprint | presidential approval | national | ||
| Beauchamp | 2017 | American Journal of Political Science | election | State | ||
| Yaqub et al. | 2017 | Government Information Quarterly | election | national | ||
| Lopez et al. | 2017 | Statistics, Politics and Policy | Brexit | national | ||
| Vepsäläinen et al. | 2017 | Government Information Quarterly | election | national | ||
| Feng et al. | 2017 | Tobacco Control | health | national | ||
| Kristensen et al. | 2017 | PloS one | politics | national | ||
| Beauchamp | 2017 | American Journal of Political Science | election | national | ||
| Oliveira et al. | 2017 | Journal of Information Technology & Politics | election | national | ||
| Masoomali et al. | 2018 | World Development | equality (gender, immigration, LGB, etc.) | international | ||
| Bastos & Mercea | 2018 | Information, Communication & Society | Brexit | constituencies | ||
| Chmielewska-Szlajfer | 2018 | Media, Culture & Society | election | national | ||
| Pasek et al. | 2018 | Public Opinion Quarterly | economic satisfaction | national | ||
| Heredia et al. | 2018 | Social Network Analysis and Mining | election | national | ||
| Zhang | 2018 | PloS one | election | national | ||
| Bansal & Srivastava | 2018 | Procedia Computer Science | election | State | ||
| Grimaldi | 2019 | Social Network Analysis and Mining | election | national | ||
| Awais et al. | 2019 | Journal of Ambient Intelligence and Humanized Computing | election | national | ||
| Pasek et al. | 2019 | Social Science Computer Review | presidential approval | national | ||
| Jaidka et al. | 2019 | Asian Journal of Communication | election | international | ||
| Pasek et al. | 2020 | Social Science Computer Review | presidential approval | national | ||
| Chin & Wang | 2020 | Journal of Forecasting | election | County & city | ||
| Stieglitz et al. | 2020 | Information Systems Frontiers | election | international | ||
| Gong et al. | 2020 | Plos one | election | State | ||
| Sepúlveda & Norambuena | 2020 | Intelligent Data Analysis | election | national |
List of publications combining social media and survey data for enrichment purposes (n = 9)
| Author(s) | Date | Title | Source | SM | Topic | Level of analysis |
|---|---|---|---|---|---|---|
| Vaccari et al. | 2015 | Journal of Computer-Mediated Communication | politics | national | ||
| Karlsen & Enjolras | 2016 | The International Journal of Press/Politics | election | national | ||
| Hofstra et al. | 2017 | American Sociological Review | equality (gender, immigration, LGB, etc.) | national | ||
| Quinlan et al. | 2018 | Information, Communication & Society | Twitter & Facebook | political communication | national | |
| Stier et al. | 2018 | Political communication | Twitter & Facebook | politics | national | |
| Cardenal et al. | 2019 | International Journal of Public Opinion Research | politics | national | ||
| Jacbs & Spierings | 2019 | Information, Communication & Society | politics | national | ||
| De Sio & Weber | 2020 | West European Politics | politics | international | ||
| Shin | 2020 | Social Media + Society | politics | national |
List of publications using survey as proxy with social media data (n = 18, continues next page)
| Author(s) | Date | Title | Source | SM | Topic | Level of analysis |
|---|---|---|---|---|---|---|
| Vaccari & Nielsen | 2013 | Journal of Information Technology & Politics | Facebook, Twitter, and YouTube | election | national | |
| LaMarre & Suzuki-Lambrecht | 2013 | Public relations review | election | users | ||
| Jensen & Anstead | 2013 | Policy & Internet | election | State | ||
| Larsson | 2015 | Journal of Information Technology & Politics | political communication | international | ||
| Theocharis et al. | 2016 | Journal of Communication | politics | national | ||
| Ceron & d’Adda | 2016 | New media & society | politics | national | ||
| Park et al. | 2017 | PLoS one | Youtube | other | international | |
| Ernst et al. | 2017 | Information, Communication & Society | Twitter & Facebook | politics | national | |
| Stier et al. | 2018 | Political communication | Twitter & Facebook | political communication | national | |
| Barberá & Zeitzoff | 2018 | International Studies Quarterly | Twitter & Facebook | political communication | international | |
| Rossini et al. | 2018 | Journal of Information Technology & Politics | Twitter & Facebook | political communication | national | |
| Rossini et al. | 2018 | Social Media | Social Media + Society | Twitter & Facebook | political communication | national |
| Rossini et al. | 2018 | Social Media + Society | Twitter & Facebook | political communication | national | |
| Plescia et al. | 2019 | Representation | politics | international | ||
| Wells et al. | 2020 | New Media & Society | politics | national | ||
| Lazarus & Thornton | 2020 | Social Science Computer Review | politics | national | ||
| Daniel & Obholzer | 2020 | Research & Politics | political communication | international | ||
| Eberl et al. | 2020 | Journal of Information Technology & Politics | politics | national |
List of publications combining social media and survey data for comparison purposes (n = 26, continues next pages)
| Author(s) | Date | Title | Source | SM | Topic | Level of analysis |
|---|---|---|---|---|---|---|
| King et al. | 2013 | Health policy | health | national | ||
| Tsou et al. | 2013 | Cartography and Geographic Information Science | Twitter (+ web pages) | election | national | |
| Kim et al. | 2013 | Book chapter (3): Social Media, Sociality, and Survey Research | politics | national | ||
| Ceron et al. | 2014 | New media & society | election | national | ||
| Barry | 2014 | Environmental management | FlickrTM (pictures+comments) | climate & environment & energy | national | |
| Van Dalen et al. | 2015 | Journal of Information Technology & Politics | politics | national | ||
| Jungherr et al. | 2016 | Journal of Computer-Mediated Communication | election | national | ||
| Bhattacharya et al. | 2016 | Journal of the Association for Information Science and Technology | politics | national | ||
| Diaz et al. | 2017 | PloS one | politics | international | ||
| Grčar et al. | 2017 | Computational social networks | Brexit | national | ||
| Davis et al. | 2017 | Journal of medical Internet research | health | national | ||
| Heikinheimo et al. | 2017 | User-generated geographic information for visitor monitoring in a national park: A comparison of social media data and visitor survey | International Journal of Fgeo-Information | other | regional | |
| Bajaj | 2017 | Asian Survey | political communication | national | ||
| Wang et al. | 2018 | Sustainability | other social media | other | local | |
| Farhadloo et al. | 2018 | JMIR Public Health Surveillance | health | national | ||
| Howell et al. | 2018 | Politics and the Life Sciences | health | national | ||
| Wainger et al. | 2018 | Ecological Economics | climate & environment & energy | national | ||
| Nawa et al. | 2018 | American Journal of Transplantation | health | national | ||
| Scarborough | 2018 | Socius | equality (gender, immigration, LGB, etc.) | region & State & national | ||
| Mancosu & Bobba | 2019 | PloS one | Facebbok | politics | national | |
| Merkley et al. | 2020 | Canadian Journal of Political Science | Twitter (+GoogleTrend) | health | national | |
| Loureiro & Alló | 2020 | Energy Policy | climate & environment & energy | international | ||
| Amaya et al. | 2020 | book chapter (5): Big Data Meets Survey Science: A Collection of Innovative Methods | health | national | ||
| Klingeren et al. | 2020 | Public opinion on Twitter? How vote choice and arguments on Twitter comply with patterns in survey data, evidence from the 2016 Ukraine referendum in the Netherlands | Acta Politica | politics | national | |
| Pasek et al. | 2020 | book chapter (6): Big Data Meets Survey Science: A Collection of Innovative Methods | campaign events | national | ||
| Nowak et al. | 2020 | Comparing covariation among vaccine hesitancy and broader beliefs within Twitter and survey data | PloS one | health | national |
List of publications combining social media and survey data for generating new insights (n = 32, continues next pages)
| Author(s) | Date | Title | Source | SM | Topic | Reason to complement | Level of analysis |
|---|---|---|---|---|---|---|---|
| Ampofo et al. | 2011 | Information, Communication & Society | election | what citizens think about surveys | national | ||
| Robillard et al. | 2013 | Journal of Medical Internet Research | Yahoo! Answers | health | capture emergent opinions | international | |
| Cavazos-Rehg et al. | 2014 | Journal of Medical Internet Research | health | capture emergent opinions | international | ||
| Russell Neuman et al. | 2014 | Journal of Communication | Twitter, blogs, forum commentaries, and traditional media news stories | politics | alternative to self-reported measures | national | |
| Kim & Kim | 2014 | International Journal of Multimedia and Ubiquitous Engineering | climate & environment & energy | capture emergent opinions | national | ||
| Trilling | 2015 | Social science computer review | Twitter (+ transcript of TV debate) | election | more nuanced approach of PO | national | |
| Williams et al. | 2015 | Global Environmental Change | climate & environment & energy | more dynamic perspective of PO | international | ||
| Kirilenko et al. | 2015 | Global Environmental Change | climate & environment & energy | “passive survey” of PO | national & regional | ||
| Thompson et al. | 2015 | Cyberpsychology, Behavior, and Social Networking | health | capture emergent opinions | national | ||
| Sajuria & Fábrega | 2016 | Digital Methods for Social Science | politics | alternative to self-reported measures | national | ||
| Settle et al. | 2016 | Political Science Research and Methods | politics | alternative to self-reported measures | State | ||
| Marchetti & Ceccobelli | 2016 | Journalism Practice | election | more nuanced approach of PO | national | ||
| Krauss et al. | 2017 | American Journal of Health Promotion | health | capture emergent opinions | national | ||
| Barisione & Ceron | 2017 | Social Media and European Politics | economic satisfaction | national | |||
| Chan & Fu | 2017 | Journal of Computer-Mediated Communication | politics | more dynamic perspective of PO | city (Hong Kong) | ||
| Flores | 2017 | American Journal of Sociology | equality (gender, immigration, LGB, etc.) | causal inference | State | ||
| Stautz et al. | 2017 | BMJ open | health | more nuanced approach of PO | national | ||
| Chadwick & Dennis | 2017 | Social media | Political Studies | Twitter (+ campaign emails & online news articles) | politics | more nuanced approach of PO | national |
| Etter at al. | 2018 | Business & Society | economic satisfaction | expand the scope of survey focus | national | ||
| Karami et al. | 2018 | International Journal of Strategic Decision Sciences | election | more nuanced approach of PO | national | ||
| Clark et al. | 2018 | Journal of Law and Courts | equality (gender, immigration, LGB, etc.) | expand the scope of survey focus | national | ||
| Couper et al. | 2019 | Biological Conservation | Twitter (+ GoogleSearches & media) | climate & environment & energy | expand the scope of survey focus | International | |
| Aydogan et al. | 2019 | Journal of Representative Democracy | politics | novel approach | national | ||
| Hatipoğlu et al. | 2019 | All Azimuth: A Journal of Foreign Policy and Peace | politics | expand the scope of survey focus | national | ||
| Vidal-Alaball et al. | 2019 | JMIR Formative Research | health | validate survey measurements | international | ||
| Barberá et al. | 2019 | American Political Science Review | politics | expand the scope of survey focus | national | ||
| Dahlberg et al. | 2020 | Zeitschrift für Vergleichende Politikwissenschaft | Different social media (+ online news) | politics | validate survey measurements | international | |
| Lovari et al. | 2020 | American Behavioral Scientist | health | national | |||
| Adams-Cohen | 2020 | American Politics Research | equality (gender, immigration, LGB, etc.) | causal inference | national & State | ||
| Tavoschi et al. | 2020 | Human Vaccines & Immunotherapeutics | health | capture emergent opinions | national | ||
| Guan et al. | 2020 | International Relations of the Asia-Pacific | politics | capture emergent opinions | national | ||
| Kinra et al. | 2020 | Transport Policy | other | expand the scope of survey focus | national |
List of publications using social media as a recruitment tool (n = 8)
| Author(s) | Date | Title | Source | SM | Topic | Level of analysis |
|---|---|---|---|---|---|---|
| Bekafigo & McBride | 2013 | Social Science Computer Review | election | State | ||
| Bode & Dalrymple | 2014 | Journal of Political Marketing | politics | national | ||
| Vaccari et al. | 2014 | Rivista Italiana di Scienza Politica | politics | national | ||
| Vaccari et al. | 2015 | Journal of Communication | politics | national | ||
| Vaccari et al. | 2015 | Journal of Computer-Mediated Communication | politics | national | ||
| Kobayashi & Ichifuji | 2015 | Tweets that matter: Evidence from a randomized field experiment in Japan | Political Communication | politics | national | |
| Burke & Kraut | 2016 | The relationship between Facebook use and well-being depends on communication type and tie strength | Journal of computer-mediated communication | well-being | international | |
| Vaccari et al. | 2016 | Social Media + Society | politics | national |