| Literature DB >> 35005154 |
Daniel Canton Enriquez1, Jose A Niembro-Ceceña1, Martin Muñoz Mandujano1, Daniel Alarcon1, Jorge Arcadia Guerrero2, Ivan Gonzalez Garcia1, Agueda Areli Montes Gutierrez2, Alfonso Gutierrez-Lopez3.
Abstract
Worldwide, COVID-19 coronavirus disease is spreading rapidly in a second and third wave of infections. In this context of increasing infections, it is critical to know the probability of a specific number of cases being reported. We collated data on new daily confirmed cases of COVID-19 breakouts in: Argentina, Brazil, China, Colombia, France, Germany, India, Indonesia, Iran, Italy, Mexico, Poland, Russia, Spain, U.K., and the United States, from the 20th of January, 2020 to 28th of August 2021. A selected sample of almost ten thousand data is used to validate the proposed models. Generalized Extreme-Value Distribution Type 1-Gumbel and Exponential (1, 2 parameters) models were introduced to analyze the probability of new daily confirmed cases. The data presented in this document for each country provide the daily probability of rate incidence. In addition, the frequencies of historical events expressed as a return period in days of the complete data set is provided.Entities:
Keywords: Coronavirus; Daily new cases statistical analysis; Exponential distribution; Gumbel distribution; Probabilistic analysis
Year: 2022 PMID: 35005154 PMCID: PMC8719919 DOI: 10.1016/j.dib.2021.107783
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Comparison between fit proposed models Exp-1P, Exp-2P and Gumbel with daily data on confirmed cases of COVID-19 in Germany.
Fig. 4Comparison between fit proposed models Exp-1P, Exp-2P and Gumbel with daily data on confirmed cases of COVID-19 in Mexico.
Fig. 5Daily new confirmed cases in Italy, probabilistic characterization with Exp-2P.
Fig. 7Daily new confirmed cases in Mexico, probabilistic characterization with Gumbel.
Parameters for fit proposed models Exponential (1, 2 parameters) and Gumbel.
| Gumbel | Exponential 2p | Exponential 1p | ||||
|---|---|---|---|---|---|---|
| Country | Total analyzed data | Scale parameter | Shape parameter | Scale parameter | Shape parameter | Location parameter |
| Argentina | 540 | 7448.0 | 4556.5 | 8284.5 | 1267.9 | 0.0001046 |
| Brazil | 546 | 300,052.0 | 13,869.0 | 25,216.4 | 13,326.7 | 0.0000259 |
| China | 581 | 126.8 | 460.6 | 837.5 | -674.9 | 0.0061500 |
| Colombia | 537 | 7106.9 | 4143.5 | 9114.9 | 1581.3 | 0.0001090 |
| France | 579 | 11,938.3 | 5691.2 | 15,305.6 | -2732.5 | 0.0000790 |
| Germany | 576 | 6768.8 | 2873.0 | 8771.6 | -1593.2 | 0.0001440 |
| India | 573 | 44,240.5 | 45,262.6 | 82,295.6 | -25,555.0 | 0.0000176 |
| Indonesia | 541 | 7890.7 | 2856.5 | 10,116.2 | -2707.4 | 0.0001350 |
| Iran | 553 | 6942.9 | 4595.5 | 8901.2 | -300.1 | 0.0001160 |
| Italy | 573 | 7049.8 | 3789.4 | 9041.7 | 1183.6 | 0.0001273 |
| Mexico | 544 | 2498.0 | 4529.4 | 5174.9 | 796.3 | 1867.00 |
| Poland | 539 | 6089.8 | 1841.6 | 7810.4 | 2454.2 | 0.0001867 |
| Russia | 572 | 6366.8 | 8022.4 | 8165.7 | 3531.2 | 0.0000855 |
| Spain | 571 | 9843.1 | 2891.5 | 12,619.4 | -4049.1 | 8570.26 |
| Turkey | 532 | 10,352.3 | 4252.0 | 13,277.3 | -3050.5 | 0.0000978 |
| U. Kingdom | 572 | 10,913.0 | 5271.3 | 13,991.1 | -2423.8 | 0.0000860 |
| United States | 580 | 50,807.4 | 36,334.4 | 65,137.7 | 508.7 | 65,646.35 |
| Subject | Data Mining and Statistical Analysis. Infectious Diseases |
| Specific subject area | Generalized Extreme-Value Distribution Type 1-Gumbel and Exponential (1, 2 parameters models applied to characterize probabilistically COVID-19 daily cases |
| Type of data | Table |
| How the data were acquired | The data on daily recent confirmed cases of COVID-19 were carefully collected from Dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU) Database. The data were built as a time-series database by Excel and probabilistic models for extreme values were satisfactorily established for analysis using Matlab. |
| Data format | Analyzed |
| Parameters for data collection | Under the framework of frequency analysis and the Moments estimation parameter method, a probabilistic fitting was carried out to the daily new confirmed Covid cases. Raw data from Argentina, Brazil, China, Colombia, France, Germany, India, Indonesia, Iran, Italy, Mexico, Poland, Russia, Spain, U.K., and the United States, were used. |
| Description of data collection | Daily data on new confirmed cases of COVID-19 outbreaks in Argentina, Brazil, China, Colombia, France, Germany, India, Indonesia, Iran, Italy, Mexico, Poland, Russia, Spain, U.K., and the United States from the 20th of January, 2020 to 28th of August 2021 are available in the Database. COVID-19 Dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU) ( |
| Data source location | Argentina, Brazil, China, Colombia, France, Germany, India, Indonesia, Iran, Italy, Mexico, Poland, Russia, Spain, U.K., and the United States. |
| Data accessibility | The analyzed data is publicly hosted in the mendeley repositories with the following data: |