| Literature DB >> 30140224 |
Elise Bigeard1,2,3, Natalia Grabar1, Frantz Thiessard1,3,4.
Abstract
Drug misuse may happen when patients do not follow the prescriptions and do actions which lead to potentially harmful situations, such as intakes of incorrect dosage (overuse or underuse) or drug use for indications different from those prescribed. Although such situations are dangerous, patients usually do not report the misuse of drugs to their physicians. Hence, other sources of information are necessary for studying these issues. We assume that online health fora can provide such information and propose to exploit them. The general purpose of our work is the automatic detection and classification of drug misuses by analysing user-generated data in French social media. To this end, we propose a multi-step method, the main steps of which are: (1) indexing of messages with extended vocabulary adapted to social media writing; (2) creation of typology of drug misuses; and (3) automatic classification of messages according to whether they contain drug misuses or not. We present the results obtained at different steps and discuss them. The proposed method permit to detect the misuses with up to 0.773 F-measure.Entities:
Keywords: France; drug misuse; natural language processing; patient safety; social media
Year: 2018 PMID: 30140224 PMCID: PMC6094963 DOI: 10.3389/fphar.2018.00791
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Figure 1Schema of the steps of the methods.
Figure 2Schema of experiments performed for the detection of messages with drug misuses.
Desciption of the generated semantic resources: their size and examples.
| 29 | |||
| 1,206 | |||
| 83 | |||
| 20,514 | |||
| 69 | |||
| 353 | |||
| 180 | |||
| 298 |
Evaluation of indexing on test corpus (400 messages), done at message and sentence levels.
| 297 | 0.868 | 0.505 | 0.639 | 425 | 0.779 | 0.437 | 0.560 | |
| 388 | 0.660 | 577 | ||||||
| 299 | 0.869 | 0.509 | 0.642 | 430 | 0.780 | 0.442 | 0.565 | |
| 339 | 0.867 | 0.577 | 0.693 | 486 | 0.778 | 0.500 | 0.609 | |
| 416 | 0.268 | 0.389 | 469 | 0.113 | 0.483 | 0.184 | ||
| 312 | 0.558 | 0.531 | 0.544 | 436 | 0.482 | 0.449 | 0.465 | |
| 334 | 0.536 | 0.568 | 0.552 | 457 | 0.481 | 0.470 | 0.475 | |
| 338 | 0.539 | 0.575 | 0.557 | 444 | 0.450 | 0.457 | 0.453 | |
| 431 | 0.617 | 0.734 | 0.670 | 618 | 0.504 | 0.636 | 0.563 | |
| 506 | 0.291 | 0.862 | 0.435 | 696 | 0.156 | 0.716 | 0.256 | |
The best result for each metric is marked up with a bold font.
Figure 3Typology of drug misuses.
Figure 4Binary experience Misuse/Rest with the Drugs set of features and different algorithms.
Figure 5Combined experience No use/Rest followed by Use/Misuse with the Drugs set of features and different algorithms.
Figure 6Impact of drug names on automatic categorization results of misuses, in terms of F-measure: misuse/rest experiment, Text features and NaiveBayes algorithm.
Figure 7Impact of disorder names on automatic categorization results of misuses, in terms of F-measure: misuse/rest experiment, Text features and NaiveBayes algorithm.