| Literature DB >> 32050435 |
Francesc López Seguí1,2, Ricardo Ander Egg Aguilar3, Gabriel de Maeztu4, Anna García-Altés5, Francesc García Cuyàs6, Sandra Walsh7, Marta Sagarra Castro8, Josep Vidal-Alaball9,10.
Abstract
Background: The primary care service in Catalonia has operated an asynchronous teleconsulting service between GPs and patients since 2015 (eConsulta), which has generated some 500,000 messages. New developments in big data analysis tools, particularly those involving natural language, can be used to accurately and systematically evaluate the impact of the service. Objective: The study was intended to assess the predictive potential of eConsulta messages through different combinations of vector representation of text and machine learning algorithms and to evaluate their performance. Methodology: Twenty machine learning algorithms (based on five types of algorithms and four text representation techniques) were trained using a sample of 3559 messages (169,102 words) corresponding to 2268 teleconsultations (1.57 messages per teleconsultation) in order to predict the three variables of interest (avoiding the need for a face-to-face visit, increased demand and type of use of the teleconsultation). The performance of the various combinations was measured in terms of precision, sensitivity, F-value and the ROC curve.Entities:
Keywords: classification; machine learning; primary care; remote consultation; teleconsultation
Year: 2020 PMID: 32050435 PMCID: PMC7036927 DOI: 10.3390/ijerph17031093
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Data recorded by the teleconsulting system.
| Conversation Title | Conversation ID | Message ID | From | To | Message | Files Attached? |
|---|---|---|---|---|---|---|
| Travelling to Australia | C1 | M1 | Mr. John Patient | Ms. Jane Doctor | Dear doctor, I’m travelling to Australia on 15 December. Do I need to have any vaccinations? Many thanks | No |
| M2 | Ms. Jane Doctor | Mr. John Patient | Hi, vaccination is required for travel to Australia | No |
Annotation by the GP.
| Conversation ID | Face-to-Face Visit Avoided? | Increased Demand? | Type of Visit |
|---|---|---|---|
| C1 | Yes | No | 6 (Vaccinations) |
Text representations and algorithms used.
| Text Representations | Algorithms |
|---|---|
| BoW | Random Forest |
| TF–IDF | Gradient Boosting (lightGBM) |
| Word2Vec | Fasttext |
| Doc2Vec | Multinomial Naive Bayes |
| Complement Naive Bayes |
Figure 1Performance metrics of algorithms.
Results of the best algorithm/text representation combination, according to the variable to be approximated. Average (SD) of the 10 iterations.
| Variable | Precision | Recall | F1 | Roc_AUC |
|---|---|---|---|---|
| Avoiding the need of a face-to-face visit | Random Forest | FastText | FastText | ComplementNB |
| Increased demand | Random Forest | FastText | FastText | FastText |
| Type of use of the teleconsultation | MultinomialNB | MultinomialNB | MultinomialNB |