| Literature DB >> 32293428 |
Xiangwen Liu1,2, Joe Meehan1, Weida Tong1, Leihong Wu3, Xiaowei Xu4, Joshua Xu5.
Abstract
BACKGROUND: Drug label, or packaging insert play a significant role in all the operations from production through drug distribution channels to the end consumer. Image of the label also called Display Panel or label could be used to identify illegal, illicit, unapproved and potentially dangerous drugs. Due to the time-consuming process and high labor cost of investigation, an artificial intelligence-based deep learning model is necessary for fast and accurate identification of the drugs.Entities:
Keywords: Daily-med; Deep learning; Drug labeling; Image recognition; Neural network; Opioid drug; Pharmaceutical packaging; Scene text detection; Semantic similarity; Similarity identification
Mesh:
Substances:
Year: 2020 PMID: 32293428 PMCID: PMC7158001 DOI: 10.1186/s12911-020-1078-3
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 2(a) The proposed drug label identification approach. (b) Architecture of the Connectionist Text Proposal Network (CTPN). (c) Simplified Architecture of Tesseract OCR
Distribution of images in drug label
| DRUGS | Number of image samples per drug label | Number of unique labels | Total images |
|---|---|---|---|
| Opioid Drugs | 2 | 196 | 392 |
| 3 | 148 | 444 | |
| 4 | 80 | 320 | |
| 5 | 42 | 210 | |
| 6 | 25 | 150 | |
| 7 | 19 | 133 | |
| 8 | 9 | 72 | |
| 9 | 2 | 18 | |
| 10 | 3 | 30 | |
| Non-opioid Drugs | 5 | 473 | 2365 |
Fig. 1(a) Sample result of text detection by CTPN. Some texts detected by CTPN may be noises for later drug product recognition. (b) One-word embedding example “codeine”. As shown, the corresponding vector of “codeine” is closer to “oxycodone” and “hydrocodone”, while vector of “codeine” is further from “glyburide” and “orlistat”, based on the functionality difference of drugs
Retrieval results on mixed images of Opioid and Non-Opioid drug label
| P @k | R @k | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Methods | k = 1 | k = 2 | k = 3 | k = 4 | k = 5 | k = 6 | k = 1 | k = 2 | k = 3 | k = 4 | k = 5 | k = 6 |
| Image-based | 0.630 | 0.535 | 0.470 | 0.413 | 0.346 | 0.297 | 0.152 | 0.258 | 0.340 | 0.398 | 0.417 | 0.429 |
| Levenshtein with text | 0.480 | 0.550 | 0.480 | 0.425 | 0.376 | 0.333 | 0.116 | 0.265 | 0.347 | 0.410 | 0.453 | 0.482 |
| Embedding of text | 0.800 | 0.720 | 0.640 | 0.565 | 0.478 | 0.405 | 0.193 | 0.347 | 0.463 | 0.545 | 0.576 | 0.586 |
| 0.5 * Image + 0.5 * Text embedding † | 0.800 | 0.725 | 0.640 | 0.570 | 0.480 | 0.410 | 0.193 | 0.349 | 0.463 | 0.549 | 0.578 | 0.593 |
| Improvement* | 27% | 32% | 33% | 34% | 28% | 23% | 27% | 32% | 33% | 34% | 28% | 23% |
Retrieval results on images of Opioid drug label
| P @k | R @k | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Methods | k = 1 | k = 2 | k = 3 | k = 4 | k = 5 | k = 6 | k = 1 | k = 2 | k = 3 | k = 4 | k = 5 | k = 6 |
| Image-based | 0.650 | 0.540 | 0.453 | 0.395 | 0.342 | 0.302 | 0.193 | 0.320 | 0.404 | 0.469 | 0.507 | 0.537 |
| Levenshtein with text | 0.580 | 0.495 | 0.407 | 0.340 | 0.298 | 0.263 | 0.172 | 0.294 | 0.362 | 0.404 | 0.442 | 0.469 |
| Embedding of text | 0.800 | 0.665 | 0.560 | 0.495 | 0.436 | 0.388 | 0.237 | 0.395 | 0.499 | 0.588 | 0.647 | 0.691 |
| 0.5 * Image + 0.5 * Text embedding † | 0.88 | 0.755 | 0.633 | 0.568 | 0.510 | 0.460 | 0.261 | 0.448 | 0.564 | 0.674 | 0.757 | 0.819 |
| Improvement* | 35% | 40% | 40% | 44% | 49% | 52% | 35% | 40% | 40% | 44% | 49% | 52% |
Fig. 3Similarity analysis among three opioid drug labeling. Each drug contains three distinct label images. Similarity scores range from 0 to 1 (most similar)
Fig. 4Advantages of our approach for regular image-based similarity analysis. (a) Example of different image resolutions for the same drug label. (b) Example of different images of the same drug label with stable text