| Literature DB >> 27496862 |
Claudia Ehrentraut1, Markus Ekholm2, Hideyuki Tanushi1, Jörg Tiedemann3, Hercules Dalianis1.
Abstract
Hospital-acquired infections pose a significant risk to patient health, while their surveillance is an additional workload for hospital staff. Our overall aim is to build a surveillance system that reliably detects all patient records that potentially include hospital-acquired infections. This is to reduce the burden of having the hospital staff manually check patient records. This study focuses on the application of text classification using support vector machines and gradient tree boosting to the problem. Support vector machines and gradient tree boosting have never been applied to the problem of detecting hospital-acquired infections in Swedish patient records, and according to our experiments, they lead to encouraging results. The best result is yielded by gradient tree boosting, at 93.7 percent recall, 79.7 percent precision and 85.7 percent F1 score when using stemming. We can show that simple preprocessing techniques and parameter tuning can lead to high recall (which we aim for in screening patient records) with appropriate precision for this task.Entities:
Keywords: clinical decision-making; databases and data mining; ehealth; electronic health records; secondary care
Mesh:
Year: 2016 PMID: 27496862 PMCID: PMC5802538 DOI: 10.1177/1460458216656471
Source DB: PubMed Journal: Health Informatics J ISSN: 1460-4582 Impact factor: 2.681
Figure 1.A high-level flow chart describing this study’s text-classification approach for automatically detecting HAI. DR stands for daily patient record. In this study, a patient’s DR comprises data from four modules. All DRs of a patient together amount to the patient’s HR.
The characteristics of the HRs used in our study.
| HAI | NoHAI | Total | |
|---|---|---|---|
| Number of HRs | 128 | 85 | 213 |
| Length of hospitalization in days | 2–144 | 3–93 | 2–144 |
| Total number of tokens | 22,528,102 | 2,598,036 | 25,126,138 |
HR: hospitalization record; HAI: hospital-acquired infection.
Different combinations of applied text-classification techniques and feature selection methods as well as the name chosen for each combination.
| Name | Text-classification method | Feature selection method |
|---|---|---|
| TF 1000 | Data not processed | TF 1000 |
| Lemma | Data lemmatized | TF 1000 |
| Stem | Data stemmed | TF 1000 |
| Stop | Stop words removed from data | TF 1000 |
| IST | Data not processed | Infection-specific terms used |
| TF-IDF 1000 | Data not processed | TF-IDF 1000 |
| LS-TFIDF 1000 | Data lemmatized + stop words removed | TF-IDF 1000 |
| SS-TFIDF 1000 | Data stemmed + stop words removed | TF-IDF 1000 |
TF: term frequency; IST: infection-specific terms; TF-IDF: term frequency–inverse document frequency.
Precision, recall and F1 score (in %) for detecting HAIs using GTB, optimized GTB, SVM and optimized SVM given the different preprocessing methods.
| GTB | GTB optimized | SVM | SVM optimized | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | |
| TF 1000 | 83.4 | 90.6 | 86.7 | 79.6 | 92.2 | 85.2 | 76.3 | 79.8 | 78.0 | 80.2 | 88.1 | 83.7 |
| Lemma | 79.3 | 87.6 | 83.0 | 76.5 | 92.2 | 83.1 | 60.1 | 100.0 | 75.1 | 78.9 | 88.2 | 83.1 |
| Stem | 82.4 | 88.3 | 85.0 | 79.7 |
| 85.7 | 60.1 | 100.0 | 75.1 | 80.7 | 89.8 | 84.8 |
| Stop | 79.0 | 83.6 | 80.6 | 79.0 | 93.0 | 85.0 | 76.5 | 78.3 | 77.4 | 83.1 | 89.8 | 84.8 |
| IST | 79.0 | 86.0 | 81.7 | 76.7 | 89.1 | 81.9 | 73.0 | 65.1 | 68.9 | 72.9 | 84.5 | 78.0 |
| TF-IDF 1000 | 81.7 | 91.2 | 86.0 | 79.5 | 92.1 | 84.9 | 60.1 | 100.0 | 75.1 | 78.1 | 89.7 | 82.8 |
| LS-TFIDF 1000 | 80.2 | 84.4 | 81.9 | 78.9 | 91.3 | 84.2 | 60.1 | 100.0 | 75.1 | 72.7 | 88.9 | 79.3 |
| SS-TFIDF 1000 | 78.6 | 85.8 | 81.6 | 78.8 | 93.0 | 85.0 | 60.1 | 100.0 | 75.1 | 75.3 | 86.6 | 79.8 |
GTB: gradient tree boosting; SVM: support vector machine; TF: term frequency; IST: infection-specific term; TF-IDF: term frequency–inverse document frequency.
In total, the material comprised 213 HRs of which 128 contained HAI giving a baseline precision of 60 percent, recall of 100 percent and F-score of 75 percent.
Figure 2.Top 20 feature importances for optimized GTB TF1000+stemming trained on the whole dataset. English translation within parenthesis. Note that since stemming is used the english translation is an approximation as directly translating a stem is not always possible.
Figure 3.Top 20 feature importances for un-optimized GTB TF1000+stemming trained on the whole dataset. English translation within parenthesis.
Figure 4.Top 20 feature importances for unoptimized GTB using TF1000 without stemming. English translation within parenthesis.
Classifier errors (optimized GTB-Stem) for the classes HAI and NoHAI, the latter being divided into four disjoint subclasses.
| Class structure | Errors | Dataset | ||
|---|---|---|---|---|
| HAI | 11 | 128 | ||
| NoHAI | CAI | Suspected HAI | 4 | 5 |
| Not suspected HAI | 8 | 18 | ||
| NoINF | Suspected HAI | 7 | 9 | |
| Not suspected HAI | 9 | 53 | ||
| Total | 39 | 213 |
GTB: gradient tree boosting; HAI: hospital-acquired infection; CAI: community-acquired infection; NoINF: no infections at all.
Classifier errors (optimized GTB-Stem) for the different types of HAIs.
| Label | Errors | Dataset |
|---|---|---|
| Ventilator-associated pneumonia | 0 | 8 |
| Sepsis | 1 | 46 |
| Pneumonia | 1 | 33 |
| Other HAI | 1 | 15 |
| Fungus/virus | 1 | 15 |
| Central venous catheter–related HAI | 1 | 10 |
| Wound infection | 2 | 25 |
| Urinary tract infection | 4 | 20 |
|
| 2 | 10 |
GTB: gradient tree boosting.
A hospitalization marked with HAI may have one or more types of HAI. Hence, a misclassified HAI hospitalization may contribute to the number of errors for multiple labels.
Recall, specificity and precision for optimized GTB-Stem compared with the results found in the “Related work” section.
| Recall | Specificity | Precision | |
|---|---|---|---|
| GTB-Stem optimized | 93.7 | 64.1 | 79.7 |
| [7] ANN Internal | 96.64 | 85.96 | – |
| [7] LR external | 82.76 | 80.90 | – |
| [10] SVM | 92.6 | 43.73 | – |
| [11] SVM | 92.0 | 72.0 | – |
| [11] NB | 87 | 74.0 | – |
| [13] FLD S2 | 82.56 | – | 43.54 |
GTB: gradient tree boosting; ANN: artificial neural network; LR: linear regression; SVM: support vector machine; NB: Naïve Bayes classifiers; FLD: Fisher’s linear discriminant.
Note that the evaluation methods and datasets are not the same.