| Literature DB >> 31066711 |
Armelle Arnoux-Guenegou1, Yannick Girardeau1,2, Xiaoyi Chen1, Myrtille Deldossi3, Rim Aboukhamis4, Carole Faviez5, Badisse Dahamna6, Pierre Karapetiantz1, Sylvie Guillemin-Lanne3, Agnès Lillo-Le Louët4, Nathalie Texier5, Anita Burgun1,2,7, Sandrine Katsahian1,7,8,9.
Abstract
BACKGROUND: Social media is a potential source of information on postmarketing drug safety surveillance that still remains unexploited nowadays. Information technology solutions aiming at extracting adverse reactions (ADRs) from posts on health forums require a rigorous evaluation methodology if their results are to be used to make decisions. First, a gold standard, consisting of manual annotations of the ADR by human experts from the corpus extracted from social media, must be implemented and its quality must be assessed. Second, as for clinical research protocols, the sample size must rely on statistical arguments. Finally, the extraction methods must target the relation between the drug and the disease (which might be either treated or caused by the drug) rather than simple co-occurrences in the posts.Entities:
Keywords: MedDRA; Racine Pharma; data mining; drug-related side effects and adverse reactions; natural language processing; social media
Year: 2019 PMID: 31066711 PMCID: PMC6528435 DOI: 10.2196/11448
Source DB: PubMed Journal: JMIR Res Protoc ISSN: 1929-0748
Figure 1Study overview. ADR: adverse drug reaction.
Figure 2Adverse drug reaction pipeline. ADR: adverse drug reaction; ATC: Anatomical Therapeutic Chemical (in ATC classification); MedDRA: Medical Dictionary for Regulatory Activities.
Figure 3Phase 2 messages selection flowchart. ADR: adverse drug reaction; ATC: Anatomical Therapeutic Chemical (in ATC classification); MedDRA: Medical Dictionary for Regulatory Activities.
List of drug and adverse event concepts selected as pharmacovigilance use cases.
| Active drug ingredient: French names (date of marketing in cases where drug has been withdrawn) | Adverse events of interest | Corresponding Medical Dictionary for Regulatory Activities term | Media coverage or alert date |
| Gardasil | Autoimmune disease, complex regional pain syndrome, and postural orthostatic tachycardia syndrome | Autoimmune disorders (HLGTa) + complex regional pain syndrome (PTb) + postural orthostatic tachycardia syndrome (PT) | 2013 |
| Meningitec | Quality defect and any adverse event | All terms | 2015 |
| Methylphénidate: Ritaline, Concerta, Medikinet, Quasym, and Ritaline | Neurological and cardiac effects | Cardiac disorders (SOCc) + nervous system disorders (SOC) | 2009 (recommendation) |
| Acétate de cyprotérone: Diane 35, Cypropharm (2010-2013), Elleacnelle (2010-2013), Evepar, Holgyme (2002-2015), Lumalia (2003-2013), Minerva, and Arielle (1997-2011) | Thromboembolic risk | Embolism and thrombosis (HLGT) | 2012 (alert) |
| Fluoxétine: Prozac, Elindra (2001-2008), Fluctine (1998-2008), and Fontex (2001-2008) | All adverse events | All terms | Mostly 2013 |
| Méthadone | Prolonged time from the start of the Q wave to the end of the T wave during electrocardiogram (approximates the time taken from when the cardiac ventricles start to contract to when they finish relaxing) | QT interval prolongation | 2007 |
| Sofosbuvir: Harvoni, Epclusa, and Sovaldi | Bradycardia | Bradycardia (PT) | 2013 |
| Codeine: Codenfan, Codoliporane, Migralgine, and Néocodion et Prontalgine | Respiratory disorders | Respiratory disorders (HLGT) | 2012 re-evaluation |
| Hydroxyzine: atarax | Rhythm disorders | Cardiac arrhythmias (HLGT) | 2014 |
| Nicorandil: Adancor and Ikorel | Skin ulceration | Skin ulcer (HLTd) | 2012 |
| Midodrine: Gutron | Hypertension | Blood pressure increased (PT) | 2013 |
| Galantamine: Reminyl, Galanthen (2012-2015), and Galema (2011-2015) | Skin reaction | Skin and subcutaneous tissue disorders (SOC) | 2014 |
| Crizotinib: Xalkori | Heart failure | Heart failures HLGT | 2014 |
| Valproate de sodium: Depakine, Imaslav, and Micropakine | Teratogenic effects | Congenital, familial, and genetic disorders (SOC) | 2012? |
| Isotrétinoine: Curacné, Acnetrait, Contracné, Procuta, and Roaccutane | Teratogenic effects + psychiatric disorders | Congenital, familial, and genetic disorders (SOC) and psychiatric disorders (SOC) | 2002 |
| Fingolimod: Gilenya | Leukoencephalopathy | Toxic leukoencephalopathy (PT) | 2014 |
| Aripiprazole: Abilify | Suicidal behavior | Suicidal and self-injurious behavior (HLGT) | 2013 (words of warning: 2016) |
aHLGT: high-level group term.
bPT: preferred term.
cSOC: system organ class.
dHLT: high-level term.
Figure 4Interface of the manual annotation platform. ID: identifier; MedDRA: Medical Dictionary for Regulatory Activities.
Figure 5Recall, precision, and f-measure definitions. ADR: adverse drug reaction; ADR-PRISM: Adverse Drug Reaction from Patient Reports in Social Media; FN: false negative, manually annotated ADR by the gold standard that is not extracted by the ADR information extraction tool; FP: false positive, positive explicit relationship extracted by the ADR information extraction tool that is not manually annotated by gold standard; TN: true negative; TP: true positive.
Sample size required according to the expected precision and recall and total width of confidence intervals (CIs).
| Range of CI | Expected precision or recall values | ||||||||
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | |
| 90% CI | 35 | 61 | 81 | 92 | 96 | 92 | 81 | 61 | 35 |
| 95% CI | 138 | 246 | 323 | 369 | 384 | 369 | 323 | 246 | 138 |
Common features and differences between phase 1 and phase 2.
| Issues | Phase 1 | Phase 2 |
| Evaluation focus | Drugs and adverse events entity recognition | Adverse drug reaction relationships recognition |
| Gold standard (manual annotations) | Not blind | Blind |
| Gold standard annotators | Pharmacovigilance experts | Experts in medical terminologies |
| Interannotator agreement | No | Yes |
| Granularity issue | No | Yes |
| Sentence boundary issue | No | Yes |