| Literature DB >> 36050787 |
Graciela Gonzalez-Hernandez1, Martin Krallinger2, Monica Muñoz3, Raul Rodriguez-Esteban4, Özlem Uzuner5, Lynette Hirschman6.
Abstract
Monitoring drug safety is a central concern throughout the drug life cycle. Information about toxicity and adverse events is generated at every stage of this life cycle, and stakeholders have a strong interest in applying text mining and artificial intelligence (AI) methods to manage the ever-increasing volume of this information. Recognizing the importance of these applications and the role of challenge evaluations to drive progress in text mining, the organizers of BioCreative VII (Critical Assessment of Information Extraction in Biology) convened a panel of experts to explore 'Challenges in Mining Drug Adverse Reactions'. This article is an outgrowth of the panel; each panelist has highlighted specific text mining application(s), based on their research and their experiences in organizing text mining challenge evaluations. While these highlighted applications only sample the complexity of this problem space, they reveal both opportunities and challenges for text mining to aid in the complex process of drug discovery, testing, marketing and post-market surveillance. Stakeholders are eager to embrace natural language processing and AI tools to help in this process, provided that these tools can be demonstrated to add value to stakeholder workflows. This creates an opportunity for the BioCreative community to work in partnership with regulatory agencies, pharma and the text mining community to identify next steps for future challenge evaluations.Entities:
Mesh:
Year: 2022 PMID: 36050787 PMCID: PMC9436770 DOI: 10.1093/database/baac071
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 4.462
Figure 1.Schematic of the drug discovery and development process.
Figure 2.Stakeholders and information sources in the drug development process.
Figure 3.Examples of entities and relations from toxicology text mining.
Table of sample applications and insertion points for text mining in drug discovery and PV
| Application | Input source | Stakeholder | Drug discovery cycle | Challenges for NLP | Access barriers | Evaluations or resource |
|---|---|---|---|---|---|---|
| Drug mechanism | Literature | Academic and pharma | Discovery and repurposing | Drug interaction w gene/protein | Literature paywalls and competitive intelligence | DrugProt (BioCreative VII) |
| Toxicity studies | Literature and toxicology reports | Pharma | Pre-clinical | Chemical-organ interactions; vocabulary for animal studies | Literature paywalls and competitive intelligence | eTOX |
| Clinical trials | Clinical trial data and EHR | Pharma and regulatory agencies | Clinical trial | Finding relevant occurrences in EHR; distinguishing AEs from other medical conditions | Privacy and local language access | n2c2, MADE 1.0 |
| Marketing intelligence | Patent filings | Pharma | Marketing | Drug interaction w gene/protein | Multilingual | CHEMDNER Patents BioCreative V (2015) |
| PV | Spontaneous reports | Pharma, regulatory agencies, clinicians and consumers | Clinical trials and post-market surveillance | Completeness of reports, duplicate reports and timeline of drug administration and adverse reaction | Privacy and local language access | |
| PV | Drug product inserts | Regulatory agencies and consumers | Post-market surveillance | Distinguishing AEs from other medical conditions | XML labels available @ DailyMed | NIST TAC (2017) ADR; ADE Eval |
| PV | Literature (case reports) | Pharma, regulatory agencies and clinicians | Clinical trials and post-market surveillance | Handling article full text; finding relevant case studies; temporal and causal relation between drug administration and AE | Literature paywalls | |
| PV | SM | Regulatory agencies, clinicians and consumers | Post-market surveillance | Finding relevant tweets; use of informal language | Identifying useful samples; access to feeds | SMM4H (2017, 2018, 2019, 2021) |