| Literature DB >> 34939028 |
Yue Wu1, Zhichao Liu1, Leihong Wu1, Minjun Chen1, Weida Tong1.
Abstract
Background & Aims: The United States Food and Drug Administration (FDA) regulates a broad range of consumer products, which account for about 25% of the United States market. The FDA regulatory activities often involve producing and reading of a large number of documents, which is time consuming and labor intensive. To support regulatory science at FDA, we evaluated artificial intelligence (AI)-based natural language processing (NLP) of regulatory documents for text classification and compared deep learning-based models with a conventional keywords-based model.Entities:
Keywords: BERT; European medicines agency; United States Food and Drug Administration; drug induced liver injury; drug labeling; named entity recognition; natural language processing; regulatory science
Year: 2021 PMID: 34939028 PMCID: PMC8685544 DOI: 10.3389/frai.2021.729834
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
FIGURE 1Quorum flowchart describes the study design. (A) Drug labeling document classification models developed and compared in this study. (B) The study design of model training and evaluation using FDA labeling documents and model validation using EMA labeling documents.
FIGURE 2Workflow for the training of sentence classification module and the development of final document classification model.
Sentence count with or without pre-defined liver-related context.
| Without pre-defined context | In context of liver (string-filter) | In context of liver (BERT for NER) | ||||
|---|---|---|---|---|---|---|
| FDA | EMA | FDA | EMA | FDA | EMA | |
| DILI positive sentences | 540 | 232 | 540 | 232 | 540 | 232 |
| DILI negative sentences | 28,712 | 14,915 | 961 | 764 | 1,313 | 927 |
FIGURE 3Evaluation and validation of the BERT NER models for context classification. (A) Confusion matrix obtained from evaluation of the BERT-based context classification module using the FDA test documents. (B) Confusion matrix obtained from evaluation of the BERT-based context classification module using the EMA validation documents.
Model evaluation and validation using cross-agency data.
| Model evaluation using FDA test documents | |||
| Document classification models | Matthews correlation coefficient | Recall | Precision |
| Deep learning-based model | 0.84 | 1.00 | 0.78 |
| Hybrid deep learning-based model | 0.87 | 1.00 | 0.82 |
| Keywords-based model | 0.60 | 0.90 | 0.58 |
| Model validation using cross-agency data (EMA test documents) | |||
| Document classification models | Matthews correlation coefficient | Recall | Precision |
| Deep learning-based model | 0.79 | 1.00 | 0.71 |
| Hybrid deep learning-based model | 0.84 | 1.00 | 0.77 |
| Keywords-based model | 0.61 | 0.96 | 0.55 |
FIGURE 4Evaluation and validation of the document classification models. (A) Confusion matrix obtained from evaluation of the AI model using FDA test documents. (B) Confusion matrix obtained from evaluation of the hybrid deep learning-based model using FDA test documents. (C) Confusion matrix obtained from evaluation of the keywords-based model using FDA test documents. (D) Confusion matrix obtained from evaluation of the AI model using EMA validation documents. (E) Confusion matrix obtained from evaluation of the hybrid deep learning-based model using EMA validation documents. (F) Confusion matrix obtained from evaluation of the keywords-based model using EMA validation documents.
FIGURE 5Representative sentences showing contributions of word tokens to model predictions. (A) DILI-positive sentence due to fatal hepatic failure. (B) DILI-positive sentence due to hepatitis/hepatic failure. (C) DILI-negative sentence that provides indication information.