| Literature DB >> 27066806 |
Harry Hochheiser1, Yifan Ning, Andres Hernandez, John R Horn, Rebecca Jacobson, Richard D Boyce.
Abstract
BACKGROUND: Because vital details of potential pharmacokinetic drug-drug interactions are often described in free-text structured product labels, manual curation is a necessary but expensive step in the development of electronic drug-drug interaction information resources. The use of nonexperts to annotate potential drug-drug interaction (PDDI) mentions in drug product label annotation may be a means of lessening the burden of manual curation.Entities:
Keywords: crowdsourcing; drug interactions, drug product labeling, structured product labels; natural language processing
Year: 2016 PMID: 27066806 PMCID: PMC4844909 DOI: 10.2196/resprot.5028
Source DB: PubMed Journal: JMIR Res Protoc ISSN: 1929-0748
Figure 1Data model used in this study for PDDIs mentioned within drug product labels.
Figure 2Pipeline for extraction of pharmacokinetic PDDIs from drug labels sections.
Figure 3Screenshots of the DOMEO PDDI annotation plugin: (a) product label excerpt with text selected by an annotator as being relevant to a PDDI and (b) form with the fields that the annotator must complete in order to describe the PDDI using the data model described in Figure 1.
Figure 4Annotator and NLP performance (F1 scores) for each of the four scenarios and overall performance across all four scenarios.
F1 measures for all participants and NLP system across all scenarios and overall.
| Annotator | Scenario | Scenario | Scenario | Scenario | Overall |
| Expert | 0.80 | 0.79 | 0.54 | 0.66 | 0.68 |
| Nonexpert 1 | 0.79 | 0.83 | 0.59 | 0.53 | 0.66 |
| Nonexpert 2 | 0.76 | 0.68 | 0.57 | 0.70 | 0.67 |
| Nonexpert 3 | 0.74 | 0.62 | 0.53 | 0.62 | 0.61 |
| NLP | 0.58 | 0.40 | 0.41 | 0.46 | 0.46 |
aNo assistance.
bPreannotation of drug mentions.
cPreannotation of drug mentions and PDDIs.
dNo assistance.
Participant self-reported task completion times.
| Participant | Scenario | <1 hour | 1-3 hours | 3-5 hours | >5 hours |
| Expert |
|
|
|
|
|
|
| 1 | X |
|
|
|
|
| 2 | X |
|
|
|
|
| 3 | X |
|
|
|
| Nonexpert 1 |
|
|
|
|
|
|
| 1 | X |
|
|
|
|
| 2 | X |
|
|
|
|
| 3 | X |
|
|
|
| Nonexpert 2 |
|
|
|
|
|
|
| 1 |
| X |
|
|
|
| 2 | X |
|
|
|
|
| 3 | X |
|
|
|
| Nonexpert 3 |
|
|
|
|
|
|
| 1 |
|
| X |
|
|
| 2 |
|
| X |
|
|
| 3 |
|
|
| X |
Usability questionnaire results. All results reported on a 5-point scale (1=very difficult to 5=very easy).
| Participant | Scenario 1 | Scenario 2 | Scenario 3 | Mean |
| Expert | 2 | 4 | 2 | 2.67 |
| Nonexpert 1 | 4 | 5 | 2 | 3.67 |
| Nonexpert 2 | 4 | 5 | 2 | 3.67 |
| Nonexpert 3 | 3 | 3 | 2 | 2.67 |
| Mean | 3.25 | 4.25 | 2 | — |
Comparison of agreement between the participants, NLP preannotation, and PDDI annotations (N=151) in the reference standard during the scenario with NER and NLP preannotation assistance (Scenario 3).
| NLP Result | No mention found | Mention | ||
| Participant | No mention | Mentiona | No mentionb | Mention |
|
| NLP FNf
| NLP FN | NLP TPc
| NLP TP |
| Expert | 59 (39.1) | 50 (33.1) | 23 (15.2) | 19 (12.6) |
| Nonexpert 1 | 46 (30.5) | 63 (41.7) | 11 (7.3) | 31 (20.5) |
| Nonexpert 2 | 43 (28.5) | 66 (43.7) | 11 (7.3) | 31 (20.5) |
| Nonexpert 3 | 49 (32.5) | 60 (39.7) | 13 (8.6) | 29 (19.2) |
aIndicates case where the user corrected an NLP error.
bIndicates cases where the NLP was correct and the user was incorrect.
cFN: false negative
dTP: true positive
Analysis of user and NLP false positives relative to the reference standard for Scenario 3.
| NLP result | No mention | Mention (n=93) | |
| Participant | Mentiona | No mentionb | Mention |
|
| NLP TN | NLP FP | NLP FP |
| Expert | 25 | 93 (100) | 0 (0) |
| Nonexpert 1 | 16 | 88 (94.6) | 5 (5.4) |
| Nonexpert 2 | 37 | 86 (92.5) | 7 (7.5) |
| Nonexpert 3 | 24 | 88 (94.6) | 5 (5.4) |
aIndicates cases where the user identified spans that were not identified by the NLP. bIndicates cases where the NLP identified spans that the participant did not annotate.