| Literature DB >> 29898743 |
Andrea C Tricco1,2, Wasifa Zarin3, Erin Lillie3, Serena Jeblee4, Rachel Warren3, Paul A Khan3, Reid Robson3, Ba' Pham3, Graeme Hirst4, Sharon E Straus3,5.
Abstract
BACKGROUND: A scoping review to characterize the literature on the use of conversations in social media as a potential source of data for detecting adverse events (AEs) related to health products.Entities:
Keywords: Adverse event; Data analytics; Drug safety; Knowledge synthesis; Social media; Surveillance
Mesh:
Year: 2018 PMID: 29898743 PMCID: PMC6001022 DOI: 10.1186/s12911-018-0621-y
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Study flow diagram
Document characteristics
| Document characteristics ( | Count (%) | |
|---|---|---|
| Year of dissemination | 2001–2004 | 1 (1.4%) |
| 2005–2008 | 1 (1.4%) | |
| 2009–2012 | 13 (18.6%) | |
| 2013–2016 | 55 (78.6%) | |
| Document type | Blog | 1 (1.4%) |
| Dissertation | 1 (1.4%) | |
| Book section | 2 (2.9%) | |
| Report | 3 (4.3%) | |
| Conference paper/poster | 23 (32.9%) | |
| Journal article | 40 (57.1%) | |
| Geographic region of publication | Asia | 2 (2.9%) |
| Australia & New Zealand | 5 (7.1%) | |
| Europe | 12 (17.1%) | |
| North America | 51 (72.9%) | |
| Funding type | Non-sponsored | 4 (5.7%) |
| Industry and public-sponsored | 5 (7.1%) | |
| Industry-sponsored | 7 (10.0%) | |
| Not reported | 26 (37.1%) | |
| Public-sponsored | 28 (40.0%) | |
| Types of social media listening systems studied for drug safety surveillance | Used an available automatic information extraction system (fully developed and available for use) | 8 (11.4%) |
| Used a manual approach for information extraction | 16 (22.9%) | |
| Used an experimental automatic information extraction system (at the development stage) | 46 (65.7%) | |
Fig. 2Wordcloud of social media sources mined in the documents
Social media data characteristics
| Social media data characteristics ( | Count (%) | |
|---|---|---|
| Number of social media sources included by study authors | 1 | 38 (54.3%) |
| 2 | 10 (14.3%) | |
| 3 | 8 (11.4%) | |
| 4 | 3 (4.3%) | |
| > 5 | 7 (10.0%) | |
| Not reported | 4 (5.7%) | |
| Type of social media sites | Patient-specific | 35 (50.0%) |
| General population | 27 (38.6%) | |
| Both patient-specific and general population | 7 (10.0%) | |
| Not reported | 1 (1.4%) | |
| Region of origin of the social media posts | USA | 5 (7.1%) |
| Spain | 2 (2.9%) | |
| Germany | 1 (1.4%) | |
| 50+ countries | 1 (1.4%) | |
| USA, Canada, UK, Australia, New Zealand | 1 (1.4%) | |
| France | 1 (1.4%) | |
| UK, North America, Australasia | 1 (1.4%) | |
| Not reported | 58 (82.9%) | |
| Language of the social media posts | English | 60 (85.7%) |
| Spanish | 2 (2.9%) | |
| French | 2 (2.9%) | |
| German | 1 (1.4%) | |
| Serbian | 1 (1.4%) | |
| Multilingual | 2 (2.9%) | |
| Not reported | 2 (2.9%) | |
| Method employed to collect social media data | Web crawling/ spidering software | 19 (27.1%) |
| API of the host site | 14 (20.0%) | |
| Keyword search | 7 (10.0%) | |
| Multiple methods | 4 (5.7%) | |
| Other methods | 13 (18.6%) | |
| Not Reported | 13 (18.6%) | |
| Duration (years) of social media data | Median (Q1, Q3) | 1.13 (0.5, 7.13) |
| Not reported | 34 (48.6%) | |
| Number of social media posts retrieved | Median (Q1, Q3) | 42,594 (4608, 711,562) |
| Not reported | 5 (7.1%) | |
Health conditions, types of surveillance, and types of health products investigated
| Topic ( | Count (%) | |
|---|---|---|
| Health conditions studied as per ICD-10 | Multiple disease system | 32 (45.7%) |
| Neoplasm | 9 (12.9%) | |
| Mental illness & behavioural disorders | 8 (11.4%) | |
| Factors influencing health status & contact with health services | 6 (8.6%) | |
| Nervous system | 6 (8.6%) | |
| Endocrine, nutritional & metabolic | 3 (4.3%) | |
| Not reported | 2 (2.9%) | |
| Circulatory system | 2 (2.9%) | |
| Injury, poisoning and certain other consequences of external causes | 1 (1.4%) | |
| Skin & subcutaneous tissue | 1 (1.4%) | |
| Type of surveillance studied | Any adverse event | 52 (74.3%) |
| Drug abuse/misuse | 6 (8.6%) | |
| Drug-to-drug interaction | 4 (5.7%) | |
| Specific adverse event (e.g., arthralgia, heart diseases, infertility) | 7 (10.0%) | |
| Treatment switching | 1 (1.4%) | |
| Type of health products included | Pharmaceutical drugs (including biologics) | 69 (98.6%) |
| Medical devices | 1 (1.4%) | |
| Natural health products | 0 (0.0%) | |
Fig. 3Steps typically involved in social media data processing flow
Utility and challenges of social media listening
| Utility and challenges of social media listening | Count (%) |
|---|---|
| Utility of social media listening for pharmacovigilance | |
| Supplemental data to traditional post-marketing safety surveillance | 31 (44.3%) |
| Captures perceptions and consequences of treatment and adverse events | 14 (20.0%) |
| Large publicly available data source | 14 (20.0%) |
| Able to discover undocumented or rare adverse events | 11 (15.7%) |
| Promising early warning system | 10 (14.3%) |
| Computationally efficient | 7 (10.0%) |
| Captures prescription drug misuse/abuse | 4 (5.7%) |
| Not biased towards severe adverse events | 7 (10.0%) |
| Captures large geographical area | 3 (4.3%) |
| Useful for risk communication | 3 (4.3%) |
| Able to extract complex medical concepts | 2 (2.9%) |
| Can be more accurate than spontaneous reporting systems | 2 (2.9%) |
| Hypothesis-generating | 2 (2.9%) |
| Able to identify undocumented drug interactions | 2 (2.9%) |
| Findings are similar to traditional systems | 1 (1.4%) |
| Captures information on adherence related to adverse events | 1 (1.4%) |
| Challenges of social media listening for pharmacovigilance | |
| Non-standard reporting format (informal language, format used to report information, amount of information provided by each user) | 30 (42.9%) |
| Difficult to draw complex semantic relationships from unstructured texts | 14 (20.0%) |
| May not be a representative population | 13 (18.6%) |
| Noise may exist in signal detection | 12 (17.1%) |
| Inadequate information to draw causality | 9 (12.9%) |
| Lacks comprehensive medical and demographic information | 8 (11.4%) |
| Subjective, incomplete or misinformation | 6 (8.6%) |
| Not a balanced coverage of all drugs and medical conditions | 5 (7.1%) |
| Data acquisition challenges due to host site restrictions | 4 (5.7%) |
| Duplication of data (double-counting) | 4 (5.7%) |
| Processing multi-lingual texts | 3 (4.3%) |
| Resource-intensive to process big data | 2 (2.9%) |