| Literature DB >> 35771759 |
Mahsa Ashouri1, Frederick Kin Hing Phoa1.
Abstract
The COVID-19 data analysis is essential for policymakers to analyze the outbreak and manage the containment. Many approaches based on traditional time series clustering and forecasting methods, such as hierarchical clustering and exponential smoothing, have been proposed to cluster and forecast the COVID-19 data. However, most of these methods do not scale up with the high volume of cases. Moreover, the interactive nature of the application demands further critically complex yet compelling clustering and forecasting techniques. In this paper, we propose a web-based interactive tool to cluster and forecast the available data of Taiwan COVID-19 confirmed infection cases. We apply the Model-based (MOB) tree and domain-relevant attributes to cluster the dataset and display forecasting results using the Ordinary Least Square (OLS) method. In this OLS model, we apply a model produced by the MOB tree to forecast all series in each cluster. Our user-friendly parametric forecasting method is computationally cheap. A web app based on R's Shiny App makes it easier for practitioners to find clustering and forecasting results while choosing different parameters such as domain-relevant attributes. These results could help in determining the spread pattern and be utilized by medical researchers.Entities:
Mesh:
Year: 2022 PMID: 35771759 PMCID: PMC9246234 DOI: 10.1371/journal.pone.0265477
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Domain-relevant attribute categories used in Taiwan COVID-19 confirmed infection cases.
| Domain-relevant attributes | Categories |
|---|---|
| Region | north, east, west, south, null (imported cases) |
| Administrative | township/city, district, null (imported cases) |
| Population | numeric—no categories |
| Imported | yes, no (local cases) |
| Airport | yes, no (the city has an international airport or not) |
Taiwan COVID-19 interactive tool panel.
| Categories | Application |
|---|---|
| Choose file to upload | let users upload the Taiwan COVID-19 dataset |
| MOB depth (number of splits + 1) | changes from ‘no split’ to ‘full tree’, which controls the tree simplicity |
| Prune option | AIC or BIC |
| Splitting variables | include all available options for domain-relevant attributes (splitting variables). Options are ‘region’, ‘administrative’, ‘population’, ‘imported’, and ‘airport’ |
| Screenshot | let users screenshot the result |
Fig 1Clustering and forecasting Taiwan COVID-19 confirmed infection cases—Part 1.
Fig 6Clustering and forecasting Taiwan COVID-19 confirmed infection cases—Part 6.
Fig 2Clustering and forecasting Taiwan COVID-19 confirmed infection cases—Part 2.
Fig 3Clustering and forecasting Taiwan COVID-19 confirmed infection cases—Part 3.
Fig 4Clustering and forecasting Taiwan COVID-19 confirmed infection cases—Part 4.
Fig 5Clustering and forecasting Taiwan COVID-19 confirmed infection cases—Part 5.
Cluster categories of Taiwan COVID-19 confirmed infection cases by choosing three as the MOB depth, AIC as pruning option, and region, population, imported, administrative, and airport as domain-relevant attributes.
| Cluster categories | Number of series | |
|---|---|---|
| Cluster 1 | Population: more than 198795 | 26 |
| Cluster 2 | Population: less than 198795 | 103 |
| Cluster 3 | Population: less than 198795 | 54 |