| Literature DB >> 34737093 |
Haowen Xu1, Andy Berres2, Gautam Thakur3, Jibonananda Sanyal4, Supriya Chinthavali5.
Abstract
We present EPIsembleVis, a web-based comparative visual analysis tool for evaluating the consistency of multiple COVID-19 prediction models. Our approach analyzes a collection of COVID-19 predictions from different epidemiological models as an ensemble and utilizes two metrics to quantify model performance. These metrics include (a) prediction uncertainty (represented as the dispersion of predictions in each ensemble) and (b) prediction error (calculated by comparing individual model predictions with the recorded data). Through an interactive visual interface, our approach provides a data-driven workflow for (a) selecting and constructing the COVID-19 model prediction ensemble based on the spatiotemporal overlap of available predictions of multiple epidemiological models, (b) quantifying the model performance using both the uncertainty of each model prediction ensemble, and the error of each ensemble member that represents individual model predictions, and (c) visualizing the spatiotemporal variability in the projection performance of individual models using a suite of novel ensemble visualization techniques, such as the data availability map, a spatiotemporal textured-tile calendar, multivariate rose chart, and time-series leaflet glyph. We demonstrate the capability of our ensemble visual interface through a case study that investigates the performance of weekly COVID-19 predictions, which are provided through the COVID-19 Forecast Hub UMass-Amherst Influenza Forecasting Center of Excellence [47] for the United States and United States Territories. The EPIsembleVis tool is implemented using open-source web technologies and adaptive system design, rendering it interoperable with Elasticsearch and Kibana for automatically ingesting COVID-19 predictions from online repositories, and it is generalizable for analyzing worldwide projections from more epidemiological models.Entities:
Keywords: COVID-19; COVID-19 data ontology; Ensemble visualization; Epidemiological models; Geographic visualization; Health geography; Multivariate; Spatiotemporal; Web-based
Mesh:
Year: 2021 PMID: 34737093 PMCID: PMC8559418 DOI: 10.1016/j.jbi.2021.103941
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 6.317
Fig. 1The overview and general user Workflow of the visual interface.
Fig. 2Spatiotemporal tiles in a calendar view for daily predictions (a), a weekly view for weekly predictions (b), and a detailed view of a single tile (c).
Fig. 5Through the leaflet glyph, we are able to overview and compare the time-series of the prediction error from 4 models across 6 states. The prediction error is estimated by comparing the model prediction with the NYT recorded data.
Fig. 3Through the visual interface, we compare the local (a) and global (b) views of leaflet glyphs for visualizing the accuracy of different models across the states.
Fig. 4Through the overlay of the NASA night-light map, the rose chart reveals that the overall disagreement between models is less for the 4-week-into-future projection. Prediction ensembles in populated areas usually have a lower coefficient of variation (indicated by light yellow colors and well-balanced petals) compared with the ensembles in sparsely populated areas. Examples of these sparsely populated areas include (a) Montana, Idaho, and Wyoming (highlighted by the green dashed ellipsoid), and (b) West Virginia (circled in blue) in the Eastern area (highlighted with the blue dashed square). Ensemble predictions in these states present high dispersion as indicated by their rose charts through the dark red color-coding and unbalanced petal sizes. Close-ups of the rose charts for Pennsylvania (orange) and West Virginia (blue) are shown on the right-hand side.