| Literature DB >> 31032483 |
Gang Luo1.
Abstract
Predictive modeling based on machine learning with medical data has great potential to improve healthcare and reduce costs. However, two hurdles, among others, impede its widespread adoption in hdealthcare. First, medical data are by nature longitudinal. Pre-processing them, particularly for feature engineering, is labor intensive and often takes 50-80% of the model building effort. Predictive temporal features are the basis of building accurate models, but are difficult to identify. This is problematic. Healthcare systems have limited resources for model building, while inaccurate models produce sub-optimal outcomes and are often useless. Second, most machine learning models provide no explanation of their prediction results. However, offering such explanations is essential for a model to be used in usual clinical practice. To address these two hurdles, this paper outlines: 1) a data-driven method for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling; and 2) a method of using these features to automatically explain machine learning prediction results and suggest tailored interventions. This provides a roadmap for future research.Entities:
Keywords: Automatic explanation; Machine learning; Medical data; Predictive modeling; Recurrent neural network; Temporal feature
Year: 2019 PMID: 31032483 PMCID: PMC6482973 DOI: 10.1016/j.glt.2018.11.001
Source DB: PubMed Journal: Glob Transit
Fig. 1.An LSTM network.
Fig. 2.A multi-component LSTM network with K components.
Fig. 3.A multi-component stacked LSTM network with K components and two recurrent layers.
Fig. 4.Identifying the effective segments of the input vector sequence in a top training instance.
Fig. 5.Time alignment of two sequences.
Fig. 6.Visualizing a cluster of three effective segments involving two longitudinal attributes.
Fig. 7.Displaying a sequence of values of the visit type attribute.
Fig. 8.Displaying the interval sequences from three patients’ hospitalization period attribute.