| Literature DB >> 31446567 |
John van Stekelenborg1, Johan Ellenius2, Simon Maskell3,4, Tomas Bergvall2, Ola Caster2, Nabarun Dasgupta5, Juergen Dietrich6, Sara Gama7, David Lewis7,8, Victoria Newbould9, Sabine Brosch9, Carrie E Pierce10, Gregory Powell11, Alicia Ptaszyńska-Neophytou12, Antoni F Z Wiśniewski13, Phil Tregunno12, G Niklas Norén2, Munir Pirmohamed14,15.
Abstract
Over a period of 3 years, the European Union's Innovative Medicines Initiative WEB-RADR project has explored the value of social media (i.e., information exchanged through the internet, typically via online social networks) for identifying adverse events as well as for safety signal detection. Many patients and clinicians have taken to social media to discuss their positive and negative experiences of medications, creating a source of publicly available information that has the potential to provide insights into medicinal product safety concerns. The WEB-RADR project has developed a collaborative English language workspace for visualising and analysing social media data for a number of medicinal products. Further, novel text and data mining methods for social media analysis have been developed and evaluated. From this original research, several recommendations are presented with supporting rationale and consideration of the limitations. Recommendations for further research that extend beyond the scope of the current project are also presented.Entities:
Mesh:
Year: 2019 PMID: 31446567 PMCID: PMC6858385 DOI: 10.1007/s40264-019-00858-7
Source DB: PubMed Journal: Drug Saf ISSN: 0114-5916 Impact factor: 5.606
Proposed classification of social media data by potential value to pharmacovigilance
| Area | Value proposition | Examples |
|---|---|---|
| Reporting and communication | Direct interaction between interested parties Increased awareness on part of the MAH, HA patient | Provides tools to report ADRs—company product websites, Medwatch, Yellow Card Sharing experiences and practices: communities of HCPs; communities of patients Two-way communication: risk communication; information sharing |
| Signal detection | Find rare events not often reported through spontaneous reporting to HAs and pharma companies Find medical side effects earlier than in other systems across a broad spectrum Alleviate underreporting known to occur in spontaneous systems | Primary signal detection tool alongside traditional (spontaneous) sources, across all products and events |
| Niche PV in pre-specified areas | Find new information in specific niche areas underrepresented in current monitoring systems May be used as a primary tool for safety signal detection in certain pre-defined narrow areas (in contrast to broad-based safety monitoring across all products/events where social media are not value-added) | Exposure during pregnancy Abuse Misuse Low exposure, e.g., orphan drugs |
| Signal evaluation | Use for strengthening of hypotheses emerging from other systems Provide additional insight into safety issues identified through other means | Ad-hoc inspection of social media posts after a safety signal has been found in other sources |
| Quality of life | Find areas of patient and HCP concern that are not necessarily medically serious, but that have a significant impact on quality of life | Insomnia Stress Depressed mood |
ADR adverse drug reaction, HA Health Authority, HCP health care provider, MAH Marketing Authorisation Holder, PV pharmacovigilance
Recommendations relating to the role of social media data in pharmacovigilance
| Recommendation | Rationale |
|---|---|
| Social media should not be used as a source of ICSRs | With the exception of posts made by patients, carers and healthcare professionals on pharmaceutical company websites that make explicit mention of adverse events, the use of social media data for pharmacovigilance is secondary to the original intended use of these data [ |
| Facebook and Twitter are currently not worthwhile to employ for the purpose of broad-ranging statistical signal detection at the expense of other pharmacovigilance activities | Applying disproportionality-based signal detection algorithms to automatically annotated Twitter/Facebook posts did not result in any predictive ability against two reference sets of signals and non-signals, in contrast to applying disproportionality analysis to VigiBasea cases. In addition, neither the first detected Twitter or Facebook posts nor the first occurrence of disproportionality in these sources would precede the actual time point of signalling, whereas in VigiBase this was more frequent, thereby negating any timing advantage of social media. This same lack of predictive ability was encountered with a relatively small sample of patient forum posts [ |
| Future research should explore the value of social media as a source of information for additional cases in signal refinement/evaluation of ADRs that may significantly affect a patient’s quality of life | Approximately 12% of posts inspected in WEB-RADR contained information relevant to quality-of-life issues, e.g. lack of sleep, anxiety etc. [ |
| If social media is considered for use in pharmacovigilance, it is recommended that a prior assessment of the absolute and relative number of available posts related to the drug and/or event of interest in different online sources is made in relation to its intended use | There is substantial variation across drugs and adverse event terms in the amount of information in social media as well as substantial variation across different social media sources. Of the 38 medicinal products included in the WEB-RADR signal detection reference set, the range of substance mentions was from five (ranibizumab) to approximately 24,000 (methylphenidate) over a 3-year period (1 March 2012 to 31 March 2015)—see Fig. Within the data collected prospectively for WEB-RADR (acquired from September 2014 through September 2017b), products with orphan or oncology-related indications were more likely to have higher volumes of posts describing potential AEs in patient fora than in Twitter (ruxolitinib had 3 × more posts describing potential AEs in fora, nilotinib had 8 ×, tobramycin 70 × and anastrozole 85 ×). Products with psychiatric indications were more likely to have a higher volume of posts in general, as well as a higher volume of posts describing potential AEs and mentions in Twitter than in patient fora (methylphenidate – 1.5 × more posts describing AEs in Twitter, zolpidem 7 ×) [ |
| Further research should be carried out to determine whether there is value in social media data for niche areas of pharmacovigilance | WEB-RADR has demonstrated that there are niche areas of pharmacovigilance where social media data are more plentiful [ |
| Consider using a predictive algorithm to identify and eliminate from the search query any medicinal product names with high levels of ambiguity to optimise time efficiency and, where applicable, cost effectiveness | The study by Hedfors et al. [ |
ADRs adverse drug reactions, AEs adverse events, ICSRs individual case safety reports
aVigiBase is the World Health Organisation’s (WHO) global database of ICSRs maintained by the Uppsala Monitoring Centre, Uppsala, Sweden. It is the largest database of its kind in the world, with over 19 million reports of suspected adverse effects of medicines submitted since 1968 by member countries of the WHO Programme for International Drug Monitoring
bIn fact, data collection continued until December 2017; however, only data through September 2017 were included in the final report
Fig. 2Number of WEB-RADR substance mentions in Twitter/Facebook (FB) at an indicator score threshold of 0.7. Figure drawn using data from Caster et al. [15]
Fig. 1Conceptual overview of the investigation of the utility of social media in safety signal detection. (AUC area under the curve, ROC receiver operating characteristics, SD signal detection)
Recommendations relating to the use of social media for signal detection
| Recommendation | Observations |
|---|---|
| When evaluating signal detection algorithms for social media data, complement overall performance analyses (e.g. receiver operating characteristics) with manual post-level assessment as a sanity check: if a large proportion of the automatically identified posts do not contain the medicinal product or adverse event term indicated, overall results must be interpreted with considerable caution | During manual inspection of post text corresponding to a social media signal, only 39.6% of the posts contained the drug and medical event of interest as an actual adverse experience. In the subset of posts with indicator score of 0.7 or above, the corresponding result was 67.3% (72 of 107 posts) |
| In evaluation of signal detection methods, proprietary reference sets should be avoided if possible | Practically, working with our WEB-RADR SD reference set has been very cumbersome since all data extraction had to be performed locally at several different sites by those authorised to access the de-anonymised controls. Further, such a reference set cannot be critically inspected or re-used outside the specific study where it was used. Finally, certain types of analyses become impossible to perform, such as aggregation based on characteristics of the medicines or adverse event terms |
| If setting up a safety surveillance system based on social media today, it is more important to first improve and calibrate adverse event recognition than the algorithms for statistical signal detection | We have generally seen small differences between different algorithms and in our exploratory study, a more advance method like SbD provided no added benefit [ |
SbD ‘Signal Before Detect’ algorithm, SD signal detection
Recommendations relating to adverse event recognition in social media data
| Recommendation | Observations |
|---|---|
| For methods developed for AE recognition in social media, evaluate its performance on a standard reference data set such as that produced by WEB-RADR, to facilitate comparison of methods | We compared the classification performance of the NLP workflow for medicinal product–AE |
| Consider the use of machine learning technology to support the recognition of social media data relevant for pharmacovigilance | Less than 2% of tweets assessed in the development of the AE recognition reference set contain AE terms [Dietrich 2019, submitted]. A large proportion of irrelevant data will exist in any social media dataset. As such, employing automated processes may enable AE recognition whilst reducing the effort required for manual review |
| Human curation should be used in conjunction with automated processes aimed at identifying potential AEs from social media with methods available today | The NLP workflows for medicinal product–AE recognition and coding were evaluated to have a precision equal to 38%. This means that the majority of automatically recognised medicinal product–AE pairs are incorrectly classified. Human curation has the potential to detect and discard such pairs and thereby increase the precision. The content of social media posts underlying signals of disproportionate reporting (SDRs) in the signal detection study was inspected and found to be severely lacking in content and interpretability. In fact, of the posts inspected, only 40% of posts contained the correct drug and the correct event as an adverse experience, pointing to a significant issue with ADR recognition [ |
| If available, use existing mappings between verbatim text and MedDRA® terms from spontaneous reporting systems to improve sensitivity in medical event recognition for social media | Our study showed that the inclusion of historical mappings from VigiBase verbatims to a dictionary of MedDRA® LLTs almost tripled the number of captured AEs. Generalisability beyond VigiBase as a source of mappings and Twitter as the domain of application is unknown |
| Consider incorporating information on medicinal product indications in automated AE recognition, thereby reducing the likelihood of falsely categorising an indication as an AE | In the AE recognition reference set, 18.5% of patients mention indications for use of a medication in conjunction with product names and AEs [Dietrich 2019 submitted]. In addition, patients may describe symptoms of their underlying conditions and AEs in the same post, making it difficult for automated processes to determine which medical conditions or symptoms are AEs versus related to a product’s indication. Absolute removal of indication-related posts may not be beneficial or result in more accurate automated coding; for example, in a post where a patient mentions that a medical product aggravated the condition that the medicine is meant to be treating |
ADRs adverse drug reactions, AEs adverse events, LLTs Lowest Level Terms, NLP natural language processing
Recommendations relating to duplicate record detection in social media data
| Recommendation | Observations |
|---|---|
| Duplicate detection should be performed in preparing social media data for use in pharmacovigilance | Having first eliminated simple retweets etc., our study found 17% of the remaining posts to be suspected duplicates, with an algorithm that has an estimated precision of 99% [ |
| Probabilistic record linkage should be considered as a complement or alternative to rule-based methods for duplicate detection in social media data | Our study found 9% suspected duplicates in a set of Twitter posts that had already been deduplicated using a method based on rules and Bloom filters. A lower proportion of additional suspected duplicates were identified for posts related to adverse events (1.6%) [ |
| Training data for duplicate detection in social media should be enriched with suspected duplicates ensuring that the method of enrichment is accounted for in the training and evaluation of the duplicate detection method; for example, through active learning | Our study showed that it was feasible to use active learning in training vigiMatch for duplicate detection in Twitter. Only 0.008% of all the possible pairs of tweets in our data were suspected duplicates, so a straight sample would include mostly non-duplicates. If training data are enriched with suspected duplicates and algorithms are trained and evaluated without considering the method of enrichment, then the method and their estimated performance will not generalise to the real-world setting |
| Future research should compare different approaches to improve computational efficiency such as blocking and locality-sensitive hashing | Computational efficiency is of great importance in duplicate detection and a comparison between different approaches was out of scope for the study at hand. In our study, a simple blocking scheme reduced the number of pairwise comparisons by 22% [ |
| General social media, as exemplified by sample data from Facebook and Twitter, are not recommended for broad statistical signal detection. |
| Social media channels may provide a useful adjunct to pharmacovigilance activities in specific niche areas such as exposure during pregnancy and abuse/misuse of medicines. |
| Future enhancement of adverse event recognition algorithms may broaden the scope and utility of social media over time. |