Literature DB >> 32501376

Analysis of Spatial Spread Relationships of Coronavirus (COVID-19) Pandemic in the World using Self Organizing Maps.

Patricia Melin1, Julio Cesar Monica1, Daniela Sanchez1, Oscar Castillo1.   

Abstract

We describe in this paper an analysis of the spatial evolution of coronavirus pandemic around the world by using a particular type of unsupervised neural network, which is called self-organizing maps. Based on the clustering abilities of self-organizing maps we are able to spatially group together countries that are similar according to their coronavirus cases, in this way being able to analyze which countries are behaving similarly and thus can benefit by using similar strategies in dealing with the spread of the virus. Publicly available datasets of coronavirus cases around the globe from the last months have been used in the analysis. Interesting conclusions have been obtained, that could be helpful in deciding the best strategies in dealing with this virus. Most of the previous papers dealing with data of the Coronavirus have viewed the problem on temporal aspect, which is also important, but this is mainly concerned with the forecast of the numeric information. However, we believe that the spatial aspect is also important, so in this view the main contribution of this paper is the use of unsupervised self-organizing maps for grouping together similar countries in their fight against the Coronavirus pandemic, and thus proposing that strategies for similar countries could be established accordingly.
© 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Coronavirus; Neural Networks; Self-Organizing Maps; Spatial Similarity

Year:  2020        PMID: 32501376      PMCID: PMC7241408          DOI: 10.1016/j.chaos.2020.109917

Source DB:  PubMed          Journal:  Chaos Solitons Fractals        ISSN: 0960-0779            Impact factor:   5.944


Introduction

Recently we have witnessed the rapid spread of the Coronavirus around the globe, beginning originally in China and then spreading to Korea and Japan, and after that to Europe and America. In particular, in the case of Europe, Italy and Spain have been hit very hard with the spread of the virus, having many confirmed cases and deaths. After that, in the American continent, the United States has also been hit very hard with the spread of the virus. So it is very critical understanding all the facets of this problem, for being able to cope with its complexity and at the same limit its negative impact on the health of the population around the world and also the economic implications for the countries. Due to the importance of finding ways to control the propagation of the virus, many papers have been put forward on these last months related to different aspects of this problem, and in particular several authors have attempted to apply computational intelligence techniques in this area. As a sample of these works we can mention the ones below. The coronavirus disease (COVID-19) is a viral infection highly transmittable caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which originally appeared in Wuhan, China, and it has sequentially propagated around the world. The intermediate source of origin and transfer to humans is not known, but the quick human to human transfer has been confirmed in many experiments. Nowadays there is not yet a clinically approved antiviral drug or vaccine that can be used against COVID-19. Recently at the end of 2019, the city of Wuhan, China, the epicenter of the current COVID-19 experienced an outbreak of a novel coronavirus that killed more than eighteen hundred and infected thousands of individuals within the first two months of the epidemic [9]. More recently, the epicenter has moved to other cities in Europe and then in America. The patients’ most notable found symptoms (according to the collected experimental data) are dry cough, dyspnea, fiver and bilateral lung infiltrates on imaging. Initially all the cases were associated to Wuhan's Huanan Seafood Wholesale Market, which trades in seafood and a wide variety of live animal species. Due to the many reported cases up to January 30th 2020, the World Health Organization (WHO) declared the Chinese outbreak of COVID-19 to be a Public Health Emergency of International Concern posing a high risk to countries with vulnerable health systems around the world [10]. There have recently been several studies with the goal of understanding the patterns of COVID-19, and one of this is: using a dataset of X-ray medical images from patients with common bacteria pneumonia confirmed with COVID-19 disease to identify possible patterns that may lead to the automatic diagnosis disease using convolutional neural networks, and the results demonstrate that the used method has significant effects on the automatic detection and diagnosis of COVID-19 [11]. Another interesting study is the investigation of the cases of COVID-19 in China using dynamic statistical techniques [12]. Other cases are: predicting commercially available antiviral drugs that may act on the novel coronavirus using a deep learning model [13] and early prediction of the 2019 novel coronavirus outbreak in mainland China based on simple mathematical model [14]. Also, the paper in [15] offers pointers to, and describes, a range of practical online/mobile GIS and mapping dashboards and applications for tracking the 2019/2020 coronavirus epidemic and associated events as they unfold around the world. In addition, in [16] the authors proposed applying the concept of cartograms to visualize both the expansion and spread of COVID-19. Finally, we have to mention that some research has been done using Artificial Intelligence (AI), for example the study in [17] in which the authors proposed the use of machine learning algorithms for improving possible case identifications of COVID-19 more quickly when using a mobile phone-based web survey. Also several AI techniques are applied in analyzing data and decision-making processes in healthcare. This means that AI-driven tools can help in identifying COVID-19 outbreaks, as well as forecast their nature of spread rate across the world [18]. However, most of the previous works deal with the temporal aspect of the problem, which means that these works are attempting to predict or forecast in different ways the coronavirus numeric data. Of course, this facet of the problem is also important, as governments want to be able to know the estimated future values of the coronavirus cases to make the right decisions regarding funds to be assigned to solving the problem. On the other hand, it is our firm believe that the spatial aspect is also very important, so in this regard the main contribution of this paper is the use of unsupervised self-organizing Kohonen maps for grouping together similar countries in their fight against the Coronavirus pandemic, and thus in this way be able to propose that strategies for similar countries could be established accordingly. In our opinion, this contribution is very important as it could complement the temporal perspective that has been developed by most of the previous papers by providing the spatial component to achieve a complete solution to the Coronavirus problem. The remaining contents of the paper are structured in the following form. Section 2 outlines the fundamental concepts of self-organizing maps, which are a particular form of unsupervised neural networks. Section 3 describes the problem at hand and the proposed methodology in this work. Section 4 summarizes the simulation results achieved with the proposed approach. Finally, Section 5 offers the conclusions and possible future works.

Self-organizing maps

The Self-organizing maps (SOM), also called the Kohonen map, is a model being used to explore and visualize patterns in high-dimensional datasets. This model was first introduced by Teuvo Kalevi Kohonen in 1982. SOM is a clustering technique that identifies groups in a dataset without having to use traditional statistical techniques. The SOM consists of only two layers: the input layer and the output layer [1]. The goal of this neural network is to transfer all input data objects with n attributes (n dimensions) to the output in a way that the objects are related to each other. The SOM is based on an unsupervised training where there is no given output target, the objective of the algorithm is to find the set of centroids (neurons) to represent the cluster, but with topological restrictions. Topology refers to a centroid arrangement on the output grid, the most common used topology grids are the hexagonal and rectangular. Each of all data objects in the dataset is assigned to each centroid. Each neuron in the SOM grid is closely related to each other and each of the inputs are connected to each of the output nodes by means of a connection weight. Weights from N input nodes to M output nodes are initialized in small values randomly [2]. The activation of the output units according to Kohonen's is shown in the Eq. 1. The modification of the weights is shown in Eq. 2: where activation of output unit j, activation value from input unit, lateral weights connecting to output unit, neurons in neighborhood, unity function returning 1 or 0, gain term decreasing over time. The lateral connections enable the SOM to learn “competitively”, meaning that the output neurons in the output layers compete for the classification of the input patterns. At the beginning of the training, the input patterns are presented to the SOM and the output object with the nearest weight vector will be the winner to represent that cluster. Equation 1 shows how the Euclidean distance is used to select the winning neuron [3]. In Figure 1 , the SOM neural network structure is illustrated with its neighborhoods around the winning neuron.
Fig. 1

Example of the SOM neural network general architecture.

Example of the SOM neural network general architecture. Artificial neural networks, such as the SOM have widely been used in many applications, such as for identification of groundwater salinity sources [4], Determination of plant communities based on bryophytes [5], Prediction of arthritis [6]. However, here the SOM is applied to classify 199 countries of the world and the 32 states of Mexico with confirmed cases of the COVID-19 to identify if there is a pattern within the: too high, high, medium and low clusters being used. The world dataset was obtained from the Humanitarian Data Exchange (HDX) [7], and the Mexican dataset from the Mexico's Government website [8].

Proposed Method

The Data base used for the experiments was obtained from the Humanitarian Data Exchange (HDX) [7], which includes data from the countries where COVID-19 cases have occurred from January 22, 2020 to May 13, 2020. The consulted datasets were the following: time_series_covid19_confirmed_global, time_series_covid19_recovered_global, and time_series_covid19_deaths_global. The data includes the confirmed, recovered and deaths cases for countries, respectively. In Figure 2 a sample of a SOM neural network used for clustering and is classification for the countries is shown.
Fig. 2

An example SOM neural network used for clustering and classification of countries.

An example SOM neural network used for clustering and classification of countries. Also the data set of the 32 states of Mexico was used for illustrating the clustering depending in the similarity patterns the data base was obtained from the Mexican dataset in the Mexico's Government website [8]. Figure 3 shows the structure of the SOM used for clustering the 32 states of Mexico.
Fig. 3

Structure of SOM neural network used for clustering the 32 states of Mexico.

Structure of SOM neural network used for clustering the 32 states of Mexico. In the case of the 32 states of Mexico, two of the most prevalent diseases in the population were also studied, which are hypertension and diabetes. This is in order to find similarities and form groupings by states between the diseases and Covid-19. The database of these diseases was obtained from the open data web page of the Mexican Institute of Social Security (IMSS) [9].

Simulation Results

The proposed method based on the Kohonen self-organizing maps was used to form groupings or clusters of countries in the world, and after that their classification was done by considering 4 classes according to the severity of the number of Coronavirus cases: Very High, High, Medium and Low (indicated by red, orange, yellow and green, respectively, in the maps). In Table 1 , countries are ordered by number of cases occurring in the clusters, and then alphabetically inside the clusters. In the following Figures we show the obtained results with the proposed method using the publicly available data sets of confirmed, recovered and death cases.
Table 1

The results of confirmed cases of Covid-19 around the world (up to May 13, 2020).

ClusterCountryValue
Very HighUnited States (US)1390361
HighBrazil189157
France178184
Germany174098
Italy222104
Russia242271
Spain228691
Turkey143114
United Kingdom230986
MediumBelgium53981
Canada73568
Chile34381
China84024
India78055
Iran112725
Mexico40186
Netherlands43410
Pakistan35298
Peru76306
Saudi Arabia44830
LowAfghanistan5226
Albania880
Algeria6253
The results of confirmed cases of Covid-19 around the world (up to May 13, 2020). In Figure 4 we show a plot of the clusters formed with the SOM method, clearly indicating the classes for the Covid-19 Confirmed cases for the 22-01-2020 to 13-05-2020 period of time.
Fig. 4

Classification of countries according to confirmed Coronavirus cases.

Classification of countries according to confirmed Coronavirus cases. In Figure 5 we show a plot of the clusters formed with the SOM method, clearly indicating the classes for the Covid-19 recovered cases for the January 22 of 2020 to May 13 of 2020 period of time.
Fig. 5

Classification of countries according to recovered Coronavirus cases.

Classification of countries according to recovered Coronavirus cases. In addition, the same analysis can be done for the spatial distribution of deaths due to Coronavirus around the globe. In Figure 6 we show a plot of the clusters formed with the SOM method, clearly indicating the classes for the Covid-19 death cases for the January 22 of 2020 to May 13 of 2020 period of time.
Fig. 6

Classification of countries according to death related Coronavirus cases.

Classification of countries according to death related Coronavirus cases. We were also interested in taking down this spatial analysis to the country level, and for this we applied it to the country of Mexico. In this case, we have to consider 32 states in Mexico and the SOM method clusters states according to their similarities to other states, finding in this way a colored map similar to the world map. In Figure 7 we can find the clustering of states in Mexico according to the confirmed Coronavirus cases during the period of time from February 27 of 2020 to May 13 of 2020. In Table 2 , states of Mexico are ordered by number of cases in the clusters, and then alphabetically inside the clusters.
Fig. 7

Classification of states in Mexico according to confirmed Coronavirus cases.

Table 2

The results of confirmed cases of Covid-19 in the states of Mexico (up to May 13, 2020).

ClusterStateValue
Very HighCiudad de México10946
Estado de México6813
HighBaja California2764
Sinaloa1620
Tabasco1976
Veracruz1574
MediumChihuahua768
Coahuila616
Guanajuato580
Guerrero670
Hidalgo637
Jalisco699
Michoacán678
Morelos915
Nuevo León717
Puebla1213
Quintana Roo1177
Sonora642
Tamaulipas799
Yucatán924
LowAguascalientes398
Baja California Sur409
Campeche226
Chiapas450
Colima46
Durango127
Nayarit252
Oaxaca291
Querétaro315
San Luis Potosí338
Tlaxcala438
Classification of states in Mexico according to confirmed Coronavirus cases. The results of confirmed cases of Covid-19 in the states of Mexico (up to May 13, 2020). In addition, the same analysis can be done for the spatial distribution of deaths due to Coronavirus in the states of Mexico. In Figure 8 we show a plot of the clusters formed with the SOM method, clearly indicating the classes for the Covid-19 death cases for the February 27 of 2020 to May 13 of 2020 period of time
Fig. 8

Classification of states in Mexico according to death Coronavirus cases.

Classification of states in Mexico according to death Coronavirus cases. In this case, we were also interested in the possible relation of propensity of Coronavirus deaths to the chronic degenerative Hypertension and Diabetes diseases. Based on this, we also applied SOM clustering to the publicly available data in Mexico of these cases [18], [19] In Figure 9 we can find the results of clustering the states of Mexico according to the number of Hypertension cases from 2000 to 2018.
Fig. 9

Classification of states in Mexico according to the number of Hypertension cases.

Classification of states in Mexico according to the number of Hypertension cases. If we compare Figures 8 and 9 we can find that there is a similarity between states with higher number of deaths to the states with higher number of Hypertension cases, confirming a relation between these variables. In addition, in Figure 10 we can find the results of clustering the states of Mexico according to the number of Diabetes cases from 2000 to 2018.
Fig. 10

Classification of states in Mexico according to the number of Diabetes cases.

Classification of states in Mexico according to the number of Diabetes cases. Once again, if we compare Figures 8 and 10 we can find that there is a similarity between states with higher number of deaths to the states with higher number of Diabetes cases, confirming a relation between these variables. In this regard, we believe a model could be constructed using the number of cases of hypertension and diabetes to estimate the number of Coronavirus cases, that could reflect the interaction among these variables.

Conclusions

In this paper an analysis of the spatial evolution of coronavirus pandemic around the world by using a particular type of unsupervised neural network was presented. Based on the clustering abilities of self-organizing maps we were able to spatially group together countries that are similar according to their coronavirus cases, in this way being able to analyze which countries are behaving similarly and thus can benefit by using similar strategies in dealing with the spread of the virus. Publicly available datasets of coronavirus cases around the globe from the last months were used in the analysis. Interesting conclusions have been obtained, that could be helpful in deciding the best strategies in dealing with this virus. In addition, the proposed approach was tested with the spatial distribution of cases around the country of Mexico and its relation to the Diabetes and Hypertension cases. Most of the previous papers dealing with data of the Coronavirus have viewed the problem on its temporal aspect, which is also important, but this is mainly concerned with the forecast of the numeric information. However, we believe that the spatial aspect is also important, so in this view the main contribution of this paper is the use of unsupervised self-organizing maps for grouping together similar countries in their fight against the Coronavirus pandemic, and thus proposing that strategies for similar countries could be established accordingly. As future work, we envision integrating both the spatial and temporal aspects of the Coronavirus spread problem in a unified manner to achieve a complete view and solution to the problem. We can also consider applying other intelligent techniques (like fuzzy logic, evolutionary algorithms and swarm intelligence) that could help in dealing in a better way with this complex problem. Finally, we could also consider other recent approaches, as the ones presented in [20, 21], and other recent interesting works related to evolving fuzzy models and chaos, like in [22], [23], [24], [25], [26]. In summary, we envision that there are many potential beneficial lines of research that could be engaged.

CRediT authorship contribution statement

Patricia Melin: Methodology, Data curation, Writing - review & editing. Julio Cesar Monica: Formal analysis, Methodology, Writing - review & editing. Daniela Sanchez: Validation, Writing - review & editing. Oscar Castillo: Formal analysis, Data curation, Writing - review & editing.

Declaration of competing interest

The authors of the above manuscript whose names are listed above certify that they have NO affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.
  10 in total

1.  Early Prediction of the 2019 Novel Coronavirus Outbreak in the Mainland China Based on Simple Mathematical Model.

Authors:  Linhao Zhong; Lin Mu; Jing Li; Jiaying Wang; Zhe Yin; Darong Liu
Journal:  IEEE Access       Date:  2020-03-09       Impact factor: 3.367

2.  Identification of COVID-19 can be quicker through artificial intelligence framework using a mobile phone-based survey when cities and towns are under quarantine.

Authors:  Arni S R Srinivasa Rao; Jose A Vazquez
Journal:  Infect Control Hosp Epidemiol       Date:  2020-03-03       Impact factor: 3.254

Review 3.  COVID-19 infection: Origin, transmission, and characteristics of human coronaviruses.

Authors:  Muhammad Adnan Shereen; Suliman Khan; Abeer Kazmi; Nadia Bashir; Rabeea Siddique
Journal:  J Adv Res       Date:  2020-03-16       Impact factor: 10.479

4.  Geographical tracking and mapping of coronavirus disease COVID-19/severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic and associated events around the world: how 21st century GIS technologies are supporting the global fight against outbreaks and epidemics.

Authors:  Maged N Kamel Boulos; Estella M Geraghty
Journal:  Int J Health Geogr       Date:  2020-03-11       Impact factor: 3.918

5.  Predicting commercially available antiviral drugs that may act on the novel coronavirus (SARS-CoV-2) through a drug-target interaction deep learning model.

Authors:  Bo Ram Beck; Bonggun Shin; Yoonjung Choi; Sungsoo Park; Keunsoo Kang
Journal:  Comput Struct Biotechnol J       Date:  2020-03-30       Impact factor: 7.271

6.  AI-Driven Tools for Coronavirus Outbreak: Need of Active Learning and Cross-Population Train/Test Models on Multitudinal/Multimodal Data.

Authors:  K C Santosh
Journal:  J Med Syst       Date:  2020-03-18       Impact factor: 4.460

7.  Investigating the cases of novel coronavirus disease (COVID-19) in China using dynamic statistical techniques.

Authors:  Samuel Asumadu Sarkodie; Phebe Asantewaa Owusu
Journal:  Heliyon       Date:  2020-04-04

Review 8.  World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19).

Authors:  Catrin Sohrabi; Zaid Alsafi; Niamh O'Neill; Mehdi Khan; Ahmed Kerwan; Ahmed Al-Jabir; Christos Iosifidis; Riaz Agha
Journal:  Int J Surg       Date:  2020-02-26       Impact factor: 6.071

9.  Analysis and forecast of COVID-19 spreading in China, Italy and France.

Authors:  Duccio Fanelli; Francesco Piazza
Journal:  Chaos Solitons Fractals       Date:  2020-03-21       Impact factor: 5.944

10.  Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus.

Authors:  B Robson
Journal:  Comput Biol Med       Date:  2020-02-26       Impact factor: 4.589

  10 in total
  33 in total

1.  Multicriteria group decision making via generalized trapezoidal intuitionistic fuzzy number-based novel similarity measure and its application to diverse COVID-19 scenarios.

Authors:  Palash Dutta; Gourangajit Borah
Journal:  Artif Intell Rev       Date:  2022-09-07       Impact factor: 9.588

Review 2.  Artificial intelligence for forecasting and diagnosing COVID-19 pandemic: A focused review.

Authors:  Carmela Comito; Clara Pizzuti
Journal:  Artif Intell Med       Date:  2022-03-28       Impact factor: 7.011

Review 3.  The COVID-19 epidemic analysis and diagnosis using deep learning: A systematic literature review and future directions.

Authors:  Arash Heidari; Nima Jafari Navimipour; Mehmet Unal; Shiva Toumaj
Journal:  Comput Biol Med       Date:  2021-12-14       Impact factor: 6.698

4.  Optimization using the firefly algorithm of ensemble neural networks with type-2 fuzzy integration for COVID-19 time series prediction.

Authors:  Patricia Melin; Daniela Sánchez; Julio Cesar Monica; Oscar Castillo
Journal:  Soft comput       Date:  2021-01-13       Impact factor: 3.732

5.  COVID-CheXNet: hybrid deep learning framework for identifying COVID-19 virus in chest X-rays images.

Authors:  Alaa S Al-Waisy; Shumoos Al-Fahdawi; Mazin Abed Mohammed; Karrar Hameed Abdulkareem; Salama A Mostafa; Mashael S Maashi; Muhammad Arif; Begonya Garcia-Zapirain
Journal:  Soft comput       Date:  2020-11-21       Impact factor: 3.732

6.  Examining the correlation between the weather conditions and COVID-19 pandemic in India: A mathematical evidence.

Authors:  Manash Jyoti Borah; Bipan Hazarika; Sumati Kumari Panda; Juan Jose Nieto
Journal:  Results Phys       Date:  2020-11-18       Impact factor: 4.476

7.  A topic-based hierarchical publish/subscribe messaging middleware for COVID-19 detection in X-ray image and its metadata.

Authors:  Süleyman Eken
Journal:  Soft comput       Date:  2020-10-19       Impact factor: 3.732

8.  A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction.

Authors:  Moses Effiong Ekpenyong; Mercy Ernest Edoho; Udoinyang Godwin Inyang; Faith-Michael Uzoka; Itemobong Samuel Ekaidem; Anietie Effiong Moses; Martins Ochubiojo Emeje; Youtchou Mirabeau Tatfeng; Ifiok James Udo; EnoAbasi Deborah Anwana; Oboso Edem Etim; Joseph Ikim Geoffery; Emmanuel Ambrose Dan
Journal:  Sci Rep       Date:  2021-07-15       Impact factor: 4.379

9.  Weather indicators and improving air quality in association with COVID-19 pandemic in India.

Authors:  Rabin Chakrabortty; Subodh Chandra Pal; Manoranjan Ghosh; Alireza Arabameri; Asish Saha; Paramita Roy; Biswajeet Pradhan; Ayan Mondal; Phuong Thao Thi Ngo; Indrajit Chowdhuri; Ali P Yunus; Mehebub Sahana; Sadhan Malik; Biswajit Das
Journal:  Soft comput       Date:  2021-07-13       Impact factor: 3.732

10.  A scalable framework for smart COVID surveillance in the workplace using Deep Neural Networks and cloud computing.

Authors:  Ajay Singh; Vaibhav Jindal; Rajinder Sandhu; Victor Chang
Journal:  Expert Syst       Date:  2021-05-06       Impact factor: 2.812

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.