| Literature DB >> 28935617 |
Cedric Bousquet1,2, Badisse Dahamna3, Sylvie Guillemin-Lanne4, Stefan J Darmoni1,3, Carole Faviez5, Charles Huot4, Sandrine Katsahian6, Vincent Leroux7, Suzanne Pereira8, Christophe Richard9, Stéphane Schück5, Julien Souvignet1, Agnès Lillo-Le Louët10, Nathalie Texier5.
Abstract
BACKGROUND: Adverse drug reactions (ADRs) are an important cause of morbidity and mortality. Classical Pharmacovigilance process is limited by underreporting which justifies the current interest in new knowledge sources such as social media. The Adverse Drug Reactions from Patient Reports in Social Media (ADR-PRISM) project aims to extract ADRs reported by patients in these media. We identified 5 major challenges to overcome to operationalize the analysis of patient posts: (1) variable quality of information on social media, (2) guarantee of data privacy, (3) response to pharmacovigilance expert expectations, (4) identification of relevant information within Web pages, and (5) robust and evolutive architecture.Entities:
Keywords: big data; medical terminology; natural language processing; pharmacovigilance; social media
Year: 2017 PMID: 28935617 PMCID: PMC5629348 DOI: 10.2196/resprot.6463
Source DB: PubMed Journal: JMIR Res Protoc ISSN: 1929-0748
Major challenges related to exploiting patient posts in pharmacovigilance.
| Question | Challenge | Response? Process? Solution? |
| How to manage social media quality? | Variable quality of information on social media | Identify social media that present high-quality content regarding relevance and completeness of information for pharmacovigilance purposes |
| How to manage data privacy? | Guarantee of data privacy | Take into account data privacy of patients posting on the Internet |
| How to deal with pharmacovigilance main objectives? | Response to pharmacovigilance expert expectations | Identify the optimal framework where analysis of patient posts can usefully complement usual pharmacovigilance processes |
| How to identify relevant information within patient posts? | Identification and processing of relevant information (eg, drugs and adverse reactions) within Web pages | Extract, process, and render relevant information on drugs and their adverse reactions |
| How to manage scalability related to big data collection in social media? | Robust and evolutive architecture | Take into account the evolution of the platform and the high quantity of data available on the Internet that requires specific methods for big data collection and storage |
Proposed methods for meeting the 5 major challenges.
| Challenge | Proposed method |
| Variable quality of information on social media | Design a scoring method that allows selection of high-quality social media |
| Guarantee of data privacy | Design a technical solution based on data minimization and access restriction that guarantees data privacy of patient posting on social media |
| Response to pharmacovigilance expert expectations | Study the pharmacovigilance expert requirements and formalize them in use case diagrams and usage scenarios |
| Identification and processing of relevant information (eg, drugs and adverse drug reactions) within Web pages | Enforce best practices based on specialized dictionaries, pattern-based matching, and natural language processing to detect drugs and their adverse reactions in patient posts |
| Robust and evolutive architecture | Build a component-based architecture that allows storage of big data and accessibility to third-party applications through Web services |
Definitions of use case diagram and usage scenario.
| Definition | Example | ||
| Unified Modeling Language provides use case diagrams in order to specify and describe use cases. These are cases of the system (ie, a description of how the user interacts with the system). Use cases are generally organized in steps, which can be considered smaller usage components (and therefore can be described as discrete use cases). | An example of use case is the selection of a drug and/or an adverse reaction upon interaction with a form displayed by the system. Another use case is to read posts from social media for this selection. | ||
| Usage scenarios are circumstances in which a user will interact with the system. Multiple usage scenarios can be defined that can correspond to a single use case. | Making a pharmacovigilance survey is a usage scenario. Indeed, we can decompose the survey into smaller components to look for similar case reports in the national pharmacovigilance database, make a literature search, and perform a search in social media. | ||
Medical terminologies selected for feeding the dictionary for the data extraction task.
| Short name | Long name | Content | Language | Number of concepts | Example | Reference |
| ATC | Anatomical Therapeutic Chemical Classification | Drugs classes and substances | FR/EN | 4717 | http://www.chu-rouen.fr/cismef/skos#ATC_CD_A/ anabolic agents for systemic use/ anabolic steroids/androstan derivatives/ androstanolone | [ |
| MedDRA | Medical Dictionary for Regulatory Activities | Adverse drug reactions | FR/EN | 74,413 | MedDRA top tree/cardiac disorders/heart failures/cardiac failure/cardiac insufficiency | [ |
| Racine Pharma | Roots of the pharmaceutical products | Short names of drugs | FR | 5164 | Ascorbic acid | manually produced |
| Vidal Drug Database | Drug database sold by the Vidal company | All the regulatory information about drugs | FR | 40,227 | XVII congenital malformations and chromosomal abnormalities/ Q00-Q07 congenital malformations of the nervous system/ Q00.0 anencephaly | [ |
Figure 1Use cases of the project.
Examples of drugs and adverse reactions extracted by the annotation.
| Drug effects | Medical entity | Example |
| Present (ie, | Drug: citalopram (Racine Pharma); | “ |
| Absent | Drug: androcur (Racine Pharma); | “My taking of |
aMedDRA: Medical Dictionary for Regulatory Activities.
Figure 2The technical architecture of the Adverse Drug Reactions from Patient Reports in Social Media project.