| Literature DB >> 27942425 |
Patricia Ordoñez1, Nelson Schwarz1, Adnel Figueroa-Jiménez2, Leonardo A Garcia-Lebron3, Abiel Roche-Lima4.
Abstract
The high frequency data in intensive care unit is flashed on a screen for a few seconds and never used again. However, this data can be used by machine learning and data mining techniques to predict patient outcomes. Learning finite-state transducers (FSTs) have been widely used in problems where sequences need to be manipulated and insertions, deletions and substitutions need to be modeled. In this paper, we learned the edit distance costs of a symbolic univariate time series representation through a stochastic finite-state transducer to predict patient outcomes in intensive care units. The Nearest-Neighbor method with these learned costs was used to classify the patient status within an hour after 10 h of data. Several experiments were developed to estimate the parameters that better fit the model regarding the prediction metrics. Our best results are compared with published works, where most of the metrics (i.e., Accuracy, Precision and F-measure) were improved.Entities:
Keywords: Classification and visualization of physiological data; Machine learning; Prediction of patient outcomes
Year: 2016 PMID: 27942425 PMCID: PMC5124435 DOI: 10.1007/s12553-016-0146-2
Source DB: PubMed Journal: Health Technol (Berl) ISSN: 2190-7196
Fig. 1Finite-state transducer with alphabet [a; b; c; λ], where λ represent the empty symbol. a deterministic - with pre-define costs b stochastic - costs as probabilities
Results of the validation metrics by changing the parameters of the data conversion when stochastic finite-state transducer model is used to determine the stochastic edit distance costs
| Exp. | Parameters | Results | ||
|---|---|---|---|---|
| # | Window Size | Symbols | Alphabet | |
| 1 | 120 | 6 | 3 | Accuracy =0.50 |
| Precision =0.0 | ||||
| Recall = NaN | ||||
| F-Measure = NaN | ||||
| 2 | 120 | 6 | 4 | Accuracy =0.5 |
| Precision =0.0 | ||||
| Recall = NaN | ||||
| F-Measure = NaN | ||||
| 3 | 120 | 6 | 5 | Accuracy =0.51 |
| Precision =0.23 | ||||
| Recall =0.5 | ||||
| F-Measure =0.31 | ||||
| 4 | 120 | 12 | 3 | Accuracy =0.76 |
| Precision =0.68 | ||||
| Recall =0.78 | ||||
| F-Measure =0.73 | ||||
| 5 | 120 | 12 | 4 | Accuracy =0.35 |
| Precision =0.28 | ||||
| Recall =0.25 | ||||
| F-Measure =0.26 | ||||
| 6 | 120 | 12 | 5 | Accuracy =0.68 |
| Precision =0.48 | ||||
| Recall =0.69 | ||||
| F-Measure =0.56 | ||||
| 7 | 120 | 24 | 3 | Accuracy =0.85 |
| Precision =0.82 | ||||
| Recall =0.87 | ||||
| F-Measure =0.85 | ||||
| 8 | 120 | 24 | 4 | Accuracy =0.69 |
| Precision =0.78 | ||||
| Recall =0.69 | ||||
| F-Measure =0.73 | ||||
| 9 | 120 | 24 | 5 | Accuracy =0.67 |
| Precision =0.81 | ||||
| Recall =0.57 | ||||
| F-Measure =0.67 | ||||
Metric Comparison: baseline vs current work
| Ordoñez et al. [ | Current Work | |
|---|---|---|
| Accuracy | 0.81 a | 0.85 |
| Precision | 0.79 b | 0.82 |
| Recall | 0.96 c | 0.87 |
| F-measure | 0.84 d | 0.85 |
aMost accurate value from Multivariate Piecewise Dynamic Time Warping
bMost precise value from Multivariate Stacked Bags of Patterns
cHighest recall value from Multivariate Piecewise Dynamic Time Warping
dHighest F-measure value from Multivariate Piecewise Dynamic Time Warping