Literature DB >> 35990010

ModInterv: An automated online software for modeling epidemics.

Arthur A Brum1, Gerson C Duarte-Filho2, Raydonal Ospina3, Francisco A G Almeida2, Antônio M S Macêdo1, Giovani L Vasconcelos4.   

Abstract

The COVID-19 pandemic has proven the importance of mathematical tools to understand the evolution of epidemic outbreaks and provide reliable information to the general public and health authorities. In this perspective, we have developed ModInterv, an online software that applies growth models to monitor the evolution of the COVID-19 epidemic in locations chosen by the user among countries worldwide or states and cities in the USA or Brazil. This paper describes the software capabilities and its use both in recent research works and by technical committees assisting government authorities. Possible applications to other epidemics are also briefly discussed.
© 2022 The Author(s).

Entities:  

Keywords:  COVID-19; Curve fitting; Epidemic curve; Growth models

Year:  2022        PMID: 35990010      PMCID: PMC9375249          DOI: 10.1016/j.simpa.2022.100409

Source DB:  PubMed          Journal:  Softw Impacts        ISSN: 2665-9638


Code metadata

Introduction

The ongoing COVID-19 pandemic is a public health emergency with severe impacts on the lives of people everywhere around the globe. Sadly, the disease has thus far claimed the lives of over 6 million people, a figure that ranks the COVID-19 pandemic among the deadliest in history. Public health authorities play a major role in the fight against COVID-19, making decisions concerning the use of masks, stay-at-home mandates, school closures, among other non-pharmacological interventions. Obtaining reliable information is paramount to guide decision makers in choosing the best course of action to slow down (and eventually stop) the spread of the disease. In this context, mathematical models and data processing softwares are indispensable tools in the fight against epidemics. Deterministic growth models are phenomenological approaches [1] governed by a single differential equation that often admit analytical solution. Their power lies in the fact that they are capable of capturing the basic features of epidemic dynamics, without the need to describe the underlying epidemiological mechanisms. One growth model that has proven quite effective in describing the complex features of COVID-19 epidemic curves is a generalized logistic model known as the beta logistic model (BLM) [2]. An extension of the BLM with time dependent parameters has also been shown to be able to efficiently model epidemics with multiple waves of infections [3]. In this paper, we present the main features of the software ModInterv which enables the user to monitor the dynamical stages of COVID-19 epidemic curves for both infection cases and deaths by implementing the multiwave BLM growth model mentioned above. More specifically, after the user chooses a type of data (cases or deaths) and a desired location (country or state/city in the USA or Brazil), the software automatically (i.e., without human assistance) performs a model fitting to the empirical data, extracts relevant information about the epidemic dynamics from the theoretical curve, and makes short-term predictions about the disease’s likely progression. The software is written in the Python programming language and made available in an HTML page currently accessed via the address http://fisica.ufpr.br/modinterv. The COVID-19 data used by ModInterv are fetched from public data repositories. Once the data is properly fetched, a plot of the empirical data is displayed and the user then has the option to perform a curve fitting, in which case the result of the best fit is shown superimposed with the empirical data. The software also allows the user to perform additional analysis on the basis of the fitted model. We note in passing that there are several other softwares developed for application to the COVID-19 pandemic in the most diverse forms. One very common type of software is the dashboard, which is a visual display of a given dataset with the possibility to change between charts, graphs or the data themselves, as is the case of the World Health Organization Coronavirus Dashboard [4] and the Johns Hopkins University Dashboard [5]. Other softwares center their attention to forecasting the evolution of the pandemic, such as the Forecast Hub [6] and the Covidsim from Imperial College London [7], while others focus on the diagnosis of COVID-19 [8], [9] and the numerical fitting of multi-wave epidemic models [10]. The app ModInterv described here differs substantially from the aforementioned softwares in that it is a web-based application that enables the user to monitor the evolution of the COVID-19 pandemic in countries around the world as well as for states and cities in Brazil and the US, through the use of mathematical growth models, from which it can determine the current epidemic stage in the chosen location. The app can also make short-term predictions on the evolution of the epidemic curve for a given location based on the numerical fit of the growth model.

Software

ModInterv is a software that runs within IPython Notebook [11], an interactive computational environment that combines code execution, rich text, and plots. The code is divided into two similar major cells: one performs analyses of COVID-19 data for countries, while the second does the same for states and cities in the USA and Brazil. In each of these cells there are functions that perform specific tasks, such as fetching the data for the chosen location, plotting the empirical data, performing curve fits, and exporting the results onto figure files with different formats. To ensure its functionality, the code was tested and debugged before it was made available online. The software workflow and its dependencies with the main Python modules used are shown in Fig. 1. IPywidgets (also known as Jupyter Widgets) [12] is a framework that provides interactive controls for Jupyter Notebooks through the use of GUI elements, the so-called widgets, which are tools displayed as part of the output of the code that allow the user to provide inputs to the software. The main widgets used by the ModInterv software are as follows: dropdown menus, to allow the user to select the type of epidemic data (cases or deaths) to be analyzed and the desired location (countries or states and cities in the USA and Brazil); checkboxes, to select which outputs of the fit the user wants to see; and sliders, to select the final date used in the fits and the time range for a short term prediction of the epidemic curve. Once the user chooses a place and type of data, the respective empirical data are fetched from a specific data repository and imported into the software as a Pandas DataFrame [13]. Empirical data for countries as well for states and counties in the USA are fetched from the database by the Johns Hopkins University [14], whereas data for Brazilian states and cities are fetched from the GitHub database maintained by Wesley Cota [15]. After the data has been imported, the corresponding time series is then transformed into a NumPy array [16] and plotted using the Matplotlib [17], a standard graphics module in Python. If the user chooses to perform the curve fit (by clicking on the appropriate button), the non-linear least square minimization problem is dealt with via the Levenberg–Marquardt algorithm implemented in Python by the LMFIT package [18]. From the best-fit curve, the software then determines the epidemic’s previous and current stages in the chosen location by computing certain characteristic dynamical points (zeros of the second to the fourth derivatives) [19] using the SciPy package [20]. While the software’s front-end is built using the IPython and IPywidgets frameworks, its back-end is provided by a binder [21] server, which builds a Docker image from an IPython notebook into an interactive executable that can be converted to an HTML file, thus rendering the software accessible for online use.
Fig. 1

Schematic representation of the ModInterv software and its dependencies with the main Python’s libraries and modules.

Schematic representation of the ModInterv software and its dependencies with the main Python’s libraries and modules.

Impact overview

The ModInterv was developed by the MODINTERV-COVID19 Research Network, a research group formed by the present authors at the beginning of the COVID-19 pandemics. The software is licensed under a MIT license. The main purpose of the software is to offer an online tool for monitoring the epidemic evolution in countries around the world as well as in states and cities in Brazil and the USA, by means of reliable mathematical analyses (rather than mere visual inspection of epidemic curves). The software implements in a automated fashion phenomenological growth models to describe the empirical data and extract relevant information about the dynamics of the epidemic in the chosen location, thus contributing to a more precise assessment of the current epidemic situation in that location and its future evolution. Since its inception, the software has been employed by members of our research network to study the COVID-19 dynamics in countries worldwide [2], [3] as well as in states and cities in Brazil [22], [23]. The software has also been used as a complementary tool of analysis by the Scientific Committee to Combat Coronavirus (C4NE in its Portuguese acronym) of the Consórcio Nordeste, a Brazilian interstate consortium formed by the Northeastern state governments [24] with the purpose of promoting sustainable development in the region and enabling the implementation of joint public policies. Over the past two years, the C4NE has published several technical bulletins about the COVID-19 situation in the Brazilian Northeastern [25] in which the ModInterv software was used (together with other methods) to analyze the epidemic evolution in the region. Extensive use of the ModInterv was made by the COVID-19 data analysis team of the Prefecture of Teresina, capital of the Brazilian state Piauí, to monitor the epidemic there. Perhaps more importantly, they have taken the model’s predictions into account in their decision making process (e.g., as to when to relax control measures), particularly during the first wave of the epidemic [26].

Software limitations and future improvements

Notwithstanding its proven successes, as mentioned in the preceding section, the ModInterv software can be further improved to remove some of its minor limitations. The main points that can be improved concern the wave detection algorithm and the numerical fitting procedure, as briefly discussed below. Regarding wave detection, the main difficulty lies in finding which of the (many) local maxima and minima detected by the Python’s native signal processing module are actually local maxima and minima of the epidemic curve, rather than random fluctuations. In the current version of ModInterv we have implemented a filter based on the relative amplitudes of nearby maxima/minima to reject those that are likely due to fluctuations [19]. We are currently investigating the possibility of using a more selective algorithm recently introduced in the literature for detecting relevant maxima and minima [27]. Another difficulty stems from the often large number of parameters that need to be determined from the chosen empirical data (there are free parameters for an epidemic curve with waves). To avoid over-fitting, we have implemented a multi-step procedure in which we begin by fitting the first wave and then successively include data for each subsequent wave, whereby each next partial fit uses the fitted parameters from the previous step as initial guesses [19], until the entire dataset is included. This multi-step routine may, on occasion, render the numerical fit rather time consuming, especially for curves with four or more waves. It remains an open question to find more efficient optimization algorithms to speed up the best-fit routine without incurring in excessive over-fitting. The software impact can be further augmented by extending its applications to other epidemics, such as monkeypox, measles, dengue, zika, etc. Indeed, the software behind the ModInterv app can be readily applied to any infectious disease outbreak, so long as the data are available in public repositories.

Conclusions

ModInterv is an online software written in Python which enables the user to monitor COVID-19 epidemic curves around the world. The software fits empirical epidemiological data with a multiwave generalized logistic model in a fully automated manner (i.e., without human assistance), thus making a sophisticated mathematical analyses accessible to any user. In this sense, the ModInterv is a unique informatics tool for monitoring the epidemic evolution and as such it should be of interest to researchers and health authorities as well as to the general public. The ModInterv has been employed by our COVID-19 Research Network in several published research papers (http://fisica.ufpr.br/redecovid19). It has also been used by technical committees (assisting government authorities) as an auxiliary tool to analyze the evolution of the epidemic in Northeastern Brazil.

CRediT authorship contribution statement

Arthur A. Brum: Conceptualization, Methodology, Software, Validation, Investigation, Data curation, Writing – original draft, Writing – review and editing, Supervision. Gerson C. Duarte-Filho: Conceptualization, Methodology, Software, Validation, Investigation, Data curation, Writing – original draft, Writing – review and editing. Raydonal Ospina: Conceptualization, Methodology, Software, Validation, Investigation, Data curation, Writing – original draft, Writing – review and editing, Supervision. Francisco A.G. Almeida: Conceptualization, Methodology, Validation, Investigation, Writing – review and editing. Antônio M.S. Macêdo: Conceptualization, Methodology, Validation, Investigation, Writing – review and editing. Giovani L. Vasconcelos: Conceptualization, Methodology, Software, Validation, Investigation, Data curation, riting – original draft, Writing – review and editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Current code versionv1.0
Permanent link to code/repository used for this code versionhttps://github.com/SoftwareImpacts/SIMPAC-2022-142
Permanent link to Reproducible Capsulehttps://codeocean.com/capsule/1964502/tree/v1
Legal Code LicenseMIT License
Code versioning system usednone
Software code languages, tools, and services usedPython
Compilation requirements, operating environments & dependencies
If available Link to developer documentation/manual
Support email for questionsarthur.brum@ufpe.br
  10 in total

Review 1.  Mathematical models to characterize early epidemic growth: A review.

Authors:  Gerardo Chowell; Lisa Sattenspiel; Shweta Bansal; Cécile Viboud
Journal:  Phys Life Rev       Date:  2016-07-11       Impact factor: 11.025

Review 2.  Array programming with NumPy.

Authors:  Charles R Harris; K Jarrod Millman; Stéfan J van der Walt; Ralf Gommers; Pauli Virtanen; David Cournapeau; Eric Wieser; Julian Taylor; Sebastian Berg; Nathaniel J Smith; Robert Kern; Matti Picus; Stephan Hoyer; Marten H van Kerkwijk; Matthew Brett; Allan Haldane; Jaime Fernández Del Río; Mark Wiebe; Pearu Peterson; Pierre Gérard-Marchant; Kevin Sheppard; Tyler Reddy; Warren Weckesser; Hameer Abbasi; Christoph Gohlke; Travis E Oliphant
Journal:  Nature       Date:  2020-09-16       Impact factor: 49.962

3.  Power law behaviour in the saturation regime of fatality curves of the COVID-19 pandemic.

Authors:  Giovani L Vasconcelos; Antônio M S Macêdo; Gerson C Duarte-Filho; Arthur A Brum; Raydonal Ospina; Francisco A G Almeida
Journal:  Sci Rep       Date:  2021-02-25       Impact factor: 4.379

4.  COV-ADSX: An Automated Detection System using X-ray Images, Deep Learning, and XGBoost for COVID-19.

Authors:  Sharif Hasani; Hamid Nasiri
Journal:  Softw Impacts       Date:  2021-12-29

5.  Source Code for Optimized Parallel Inception: A Fast COVID-19 Screening Software.

Authors:  Alireza Tavakolian; Farshid Hajati; Alireza Rezaee; Amirhossein Oliaei Fasakhodi; Shahadat Uddin
Journal:  Softw Impacts       Date:  2022-06-22

6.  The United States COVID-19 Forecast Hub dataset.

Authors:  Estee Y Cramer; Yuxin Huang; Yijin Wang; Evan L Ray; Matthew Cornell; Johannes Bracher; Andrea Brennen; Alvaro J Castro Rivadeneira; Aaron Gerding; Katie House; Dasuni Jayawardena; Abdul Hannan Kanji; Ayush Khandelwal; Khoa Le; Vidhi Mody; Vrushti Mody; Jarad Niemi; Ariane Stark; Apurv Shah; Nutcha Wattanchit; Martha W Zorn; Nicholas G Reich
Journal:  Sci Data       Date:  2022-08-01       Impact factor: 8.501

7.  ModInterv: An automated online software for modeling epidemics.

Authors:  Arthur A Brum; Gerson C Duarte-Filho; Raydonal Ospina; Francisco A G Almeida; Antônio M S Macêdo; Giovani L Vasconcelos
Journal:  Softw Impacts       Date:  2022-08-13

8.  epidWaves: A code for fitting multi-wave epidemic models.

Authors:  Americo Cunha; Fernando da Conceição Batista; Paulo Roberto de Lima Gianfelice; Ricardo Sovek Oyarzabal; Jose Mario Vicensi Grzybowski; Elbert E N Macau
Journal:  Softw Impacts       Date:  2022-07-26

9.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

Review 10.  SciPy 1.0: fundamental algorithms for scientific computing in Python.

Authors:  Pauli Virtanen; Ralf Gommers; Travis E Oliphant; Matt Haberland; Tyler Reddy; David Cournapeau; Evgeni Burovski; Pearu Peterson; Warren Weckesser; Jonathan Bright; Stéfan J van der Walt; Matthew Brett; Joshua Wilson; K Jarrod Millman; Nikolay Mayorov; Andrew R J Nelson; Eric Jones; Robert Kern; Eric Larson; C J Carey; İlhan Polat; Yu Feng; Eric W Moore; Jake VanderPlas; Denis Laxalde; Josef Perktold; Robert Cimrman; Ian Henriksen; E A Quintero; Charles R Harris; Anne M Archibald; Antônio H Ribeiro; Fabian Pedregosa; Paul van Mulbregt
Journal:  Nat Methods       Date:  2020-02-03       Impact factor: 28.547

  10 in total
  1 in total

1.  ModInterv: An automated online software for modeling epidemics.

Authors:  Arthur A Brum; Gerson C Duarte-Filho; Raydonal Ospina; Francisco A G Almeida; Antônio M S Macêdo; Giovani L Vasconcelos
Journal:  Softw Impacts       Date:  2022-08-13
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.