| Literature DB >> 35052263 |
Nikolaos Malamas1,2, Konstantinos Papangelou1, Andreas L Symeonidis2.
Abstract
Virtual assistants are becoming popular in a variety of domains, responsible for automating repetitive tasks or allowing users to seamlessly access useful information. With the advances in Machine Learning and Natural Language Processing, there has been an increasing interest in applying such assistants in new areas and with new capabilities. In particular, their application in e-healthcare is becoming attractive and is driven by the need to access medically-related knowledge, as well as providing first-level assistance in an efficient manner. In such types of virtual assistants, localization is of utmost importance, since the general population (especially the aging population) is not familiar with the needed "healthcare vocabulary" to communicate facts properly; and state-of-practice proves relatively poor in performance when it comes to specialized virtual assistants for less frequently spoken languages. In this context, we present a Greek ML-based virtual assistant specifically designed to address some commonly occurring tasks in the healthcare domain, such as doctor's appointments or distress (panic situations) management. We build on top of an existing open-source framework, discuss the necessary modifications needed to address the language-specific characteristics and evaluate various combinations of word embeddings and machine learning models to enhance the assistant's behaviour. Results show that we are able to build an efficient Greek-speaking virtual assistant to support e-healthcare, while the NLP pipeline proposed can be applied in other (less frequently spoken) languages, without loss of generality.Entities:
Keywords: Rasa; chatbot; ehealhtcare; virtual assistant
Year: 2022 PMID: 35052263 PMCID: PMC8775452 DOI: 10.3390/healthcare10010099
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Summary of presented virtual assistants.
| Virtual Assistant | Use Cases | Interface |
|---|---|---|
| Ada [ | Health monitoring | Mobile text application |
| Youper [ | Mental health assessment | Mobile text application |
| Fitzpatrick et al. [ | Depression therapy | Mobile text application |
| Bickmore et al. [ | Virtual coach | Tablet touch screens |
| Ferguson et al. [ | Health condition tracker | Voice interface |
Figure 1Examples of supported scenarios. The green boxes represent user intents and the red the assistant’s responses or actions.
Supported intents.
| Intent | Number of Examples |
|---|---|
| Request appointment | 151 |
| Change appointment | 121 |
| Cancel appointment | 115 |
| Inform | 79 |
| Information for appointment | 75 |
| Information for drugs | 55 |
| Affirm | 30 |
| Ask functions | 25 |
| Problem | 22 |
| Confirm suggestion | 21 |
| Deny | 21 |
| Greet | 17 |
| I am fine | 17 |
| Stop the procedure | 11 |
| Thanks | 9 |
Figure 2Description of the pipeline designed. Dashed boxes indicate that when dense features were used we chose one of the two options.
Description of used configurations for the task of intent recognition.
| Configuration | Description |
|---|---|
| 1 | DIET with sparse + BERT features |
| 2 | DIET with sparse features |
| 3 | DIET with BERT features |
| 4 | polynomial SVM with BERT features |
| 5 | polynomial SVM with FastText features |
| 6 | DIET with sparse + FastText features |
| 7 | RBF SVM with FastText features |
| 8 | DIET (modified) with sparse + BERT features |
Average and standard deviation (in the parenthesis) for each metric at the task of intent classification. For the precision and recall the first line corresponds to weighted and the second to unweighted average.
| Configuration | Accuracy | Precision | Recall |
|---|---|---|---|
| 1 | 0.883 (0.025) | 0.883 (0.025) | 0.883 (0.025) |
| 0.806 (0.048) | 0.733 (0.052) | ||
| 2 | 0.894 (0.027) | 0.899 (0.028) | 0.894 (0.027) |
| 0.819 (0.053) | 0.777 (0.055) | ||
| 3 | 0.749 (0.035) | 0.760 (0.034) | 0.749 (0.035) |
| 0.720 (0.056) | 0.639 (0.058) | ||
| 4 | 0.877 (0.018) | 0.887 (0.017) | 0.877 (0.018) |
| 0.854 (0.039) | 0.787 (0.040) | ||
| 5 | 0.889 (0.021) | 0.899 (0.021) | 0.889 (0.021) |
| 0.843 (0.051) | 0.793 (0.055) | ||
| 6 | 0.901 (0.022) | 0.906 (0.022) | 0.901 (0.022) |
| 0.837 (0.044) | 0.788 (0.044) | ||
| 7 | 0.899 (0.021) | 0.907 (0.020) | 0.899 (0.021) |
| 0.866 (0.037) | 0.811 (0.040) | ||
| 8 | 0.900 (0.023) | 0.912 (0.021) | 0.900 (0.023) |
| 0.844 (0.044) | 0.780 (0.052) |
Average precision and recall for each intent using configuration 7.
| Intent | Precision | Recall | |
|---|---|---|---|
| 1 | Information for drugs | 0.990 (0.034) | 0.945 (0.062) |
| 2 | Change appointment | 0.965 (0.037) | 0.966 (0.033) |
| 3 | Information for appointment | 0.938 (0.058) | 0.941 (0.060) |
| 4 | Cancel appointment | 0.968 (0.028) | 0.940 (0.045) |
| 5 | Confirm suggestion | 1.00 (0.000) | 0.898 (0.168) |
| 6 | Request appointment | 0.913 (0.049) | 0.949 (0.037) |
| 7 | Inform | 0.857 (0.064) | 0.949 (0.058) |
| 8 | Ask Functions | 0.785 (0.146) | 0.744 (0.165) |
| 9 | Thanks | 0.960 (0.195) | 0.840 (0.272) |
| 10 | Greet | 0.866 (0.223) | 0.646 (0.269) |
| 11 | Problem | 0.677 (0.181) | 0.755 (0.186) |
| 12 | Affirm | 0.727 (0.164) | 0.793 (0.158) |
| 13 | Deny | 0.846 (0.183) | 0.574 (0.286) |
| 14 | I am fine | 0.624 (0.285) | 0.590 (0.274) |
| 15 | Stop the procedure | 0.884 (0.227) | 0.633 (0.294) |
Classification results for the entity “doctor”.
| Configuration | Precision | Recall |
|---|---|---|
| 1 | 0.995 (0.013) | 0.911 (0.059) |
| 2 | 0.960 (0.042) | 0.829 (0.061) |
| 3 | 0.994 (0.013) | 0.914 (0.080) |
| 4 | 0.994 (0.014) | 0.906 (0.061) |
| 5 | 0.995 (0.015) | 0.912 (0.061) |
| 6 | 1.00 (0.000) | 0.909 (0.067) |
| 7 | 0.998 (0.008) | 0.914 (0.071) |
| 8 | 0.983 (0.025) | 0.936 (0.059) |
Preliminary results for the intent prediction problem in the test data.
| Intent | Recall | |
|---|---|---|
| 1 | Request appointment | 100% (11/11) |
| 2 | Change appointment | 100% (9/9) |
| 3 | Cancel appointment | 100% (3/3) |
| 4 | Information for appointment | 50% (5/10) |
| 5 | Information for drugs | 87.5% (7/8) |
| 6 | Problem | 50% (3/6) |