Literature DB >> 34855909

Application of principal component analysis on temporal evolution of COVID-19.

Ashadun Nobi1, Kamrul Hasan Tuhin1, Jae Woo Lee2.   

Abstract

The COVID-19 is one of the worst pandemics in modern history. We applied principal component analysis (PCA) to the daily time series of the COVID-19 death cases and confirmed cases for the top 25 countries from April of 2020 to February of 2021. We calculated the eigenvalues and eigenvectors of the cross-correlation matrix of the changes in daily accumulated data over monthly time windows. The largest eigenvalue describes the overall evolution dynamics of the COVID-19 and indicates that evolution was faster in April of 2020 than in any other period. By using the first two PC coefficients, we can identify the group dynamics of the COVID-19 evolution. We observed groups under critical states in the loading plot and found that American and European countries are represented by strong clusters in the loading plot. The first PC plays an important role and the correlations (C1) between the normalized logarithmic changes in deaths or confirmed cases and the first PCs may be used as indicators of different phases of the COVID-19. By varying C1 over time, we identified different phases of the COVID-19 in the analyzed countries over the target time period.

Entities:  

Mesh:

Year:  2021        PMID: 34855909      PMCID: PMC8638895          DOI: 10.1371/journal.pone.0260899

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The COVID-19 is an ongoing pandemic caused by the transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) [1]. This viral disease began spreading rapidly in Wuhan, Hubei Province, China in December of 2019 [2-4]. The World Health Organization declared COVID-19 as a worldwide pandemic on March 11th, 2020 [5]. At that time, the cumulative number of infected individuals followed a low power growth trend [6]. By maintaining social distance and avoiding human-to-human contact, the spread of the virus can be reduced significantly [7, 8]. However, to limit the power growth of the infection, in addition to practicing social distancing, it is helpful to identify infected individuals and isolate them from others [6]. The governments of various countries have locked down their cities and to ensure proper social distancing among people. Different models have been applied to the time series of COVID-19 confirmed, death and recovered cases to observe and predict the COVID dynamics in different countries [9-18]. To understand COVID-19 dynamics, the susceptible exposed infectious removed model and bi-furcation analysis have been performed [9]. Fractional-order model is applied to test data which give a close forecasting to the real data [10]. Autoregressive model is applied on real world time series data to forecast the confirmed and recovered COVID-19 cases [11]. The dynamic evolution of COVID-19 in different countries is analyzed using network topology [12]. The maximum time span of the infection and annihilation of the diseases is predicted using susceptible, infectible, quarantined and confirmed recovered model based on only known data of confirmed and recovered cases [13]. Some studies have tried to combine similar countries into groups based on COVID-19 time series data by applying various clustering methods and principal component analysis (PCA) for better realization of its behavior [14-21]. Recently, factor analysis (FA) technique is used to categorize seven severely affected countries in basis of the cumulative counts of cases and deaths due to COVID-19 [14]. The relations between the spread rates of COVID-19 in high risk countries are determined by first Two PCs [15]. The first four principal components of PCA is used to investigate the cyclical patterns of the COVID-19 in different countries [16]. PCA is applied on data of 213 COVID-19 patients and identified three distinct groups of the COVID-19 patients [17]. Principal component analysis based unsupervised feature extraction is applied to RNA expression profiles of 16 COVID-19 patients and 18 healthy control subjects to identify genes associated with COVID-19 [18]. Autoregressive time series models based on two-piece scale mixture is applied on world time series of the confirmed and recovered cases of COVID-19 and forecast the confirmed and recovered cases of COVID-19 [19]. The time series model based on two-piece scale mixture normal distribution is used to predict the number of confirmed cases and death rate of COVID-19 in the world [20]. Fuzzy clustering method is applied on time series of COVID-19 in the high risk countries [21] and clustered the countries considering population’s size. Different methods were evaluated for forecasting the trajectories of COVID-19 using its previous data which are useful to determine the estimated number of infected individuals and number of deaths in advance [22-25]. In this article, we apply PCA to the correlations for the changes of cumulative time series of COVID-19 death and confirmed cases between different countries and characterize the evolving correlation structures within countries. By studying the temporal evolution of the correlations of the different affected countries, we uncover periods when there were major changes in the correlation structure of the global pandemic in the analyzed countries. We use the largest eigenvalues of the correlation matrices to observe the COVID-19 dynamics over time. Using loading plots of first two PC coefficients onto two dimensional spaces, we find different groups of the analyzed countries over time following the state of pandemic. We also find that countries make strong cluster when pandemic state is severe among the countries. Finally, the significant changes of diseases that occurred different countries are identified from the change of first PC coefficients. In this paper, we adopt a different approach to extract interesting and significant patterns from time series of COVID-19 in the affected countries. We measure the correlations from the fluctuations of COVID-19 cases and deaths among the top 25 countries such that one can imagine how fast the virus has been transmitted in the analyzed countries over the periods. To do so, we calculate the correlations for the logarithmic change of cumulative cases or deaths among the analyzed countries. To understand the correlations in more detail, we perform an eigenvalue analysis of the correlation matrix. We also perform eigenvectors analysis corresponding to first two eigenvalues. We calculate the PC coefficients which are correlations between PCs and normalized change of death or confirmed cases. The group dynamics of the affected countries is observed over time using first two PC coefficients not done before.

Methods

Let us define the logarithmic change in mortality caused by COVID-19 in country i as where N(t) is the cumulative number of deaths caused by COVID-19 in a country on day t. The normalized change in mortality for country i is defined as where σ is the standard deviation of the death in time series i over the time window and the symbol <⋯> denotes average over the time window. The size of the time window is a month. Next, we construct a normalized death matrix D for 25 countries with dimensions of N × T. The correlations of the changes in deaths among countries can be calculated as where D is the transpose of D. The correlation matrix C can be diagonalized in the following form: where P is the diagonal matrix with eigenvalues λ = (λ1, λ2,⋯,λ) in descending order and V is an orthogonal matrix of the corresponding eigenvectors. Each eigenvalue and the corresponding eigenvector can be written as where is known as ith principal component (PC) [26, 27]. The eigenvalues represent the variances of the data in the directions of the eigenvectors. A similar procedure was also performed for COVID-19 confirmed cases to calculate eigenvalues, eigenvectors, and PCs.

Data analysis

We analyzed the time series of daily COVID-19 cumulative confirmed cases and deaths from April of 2020 to February of 2021 in one month time windows. Data were collected from [28]. The daily logarithmic changes in COVID-19 death cases and confirmed cases were determined and PCA was applied to the correlation matrix of changes in COVID-19 death cases and confirmed cases for the top 25 countries. The countries are 1. United States (US), 2. India (IND), 3. Brazil (BRA), 4. United Kingdom (UK), 5. Russia (RUS), 6. France (FRA), 7. Spain (SPA), 8. Italy (ITA), 9. Turkey (TUR), 10. Germany (GER), 11. Colombia (COL), 12. Argentina (ARG), 13. Mexico (MEX), 14. Poland (POL), 15. Iran (IRN), 16. South Africa (SAF), 17. Ukraine (UKR), 18. Indonesia (INDO), 19. Peru (PER), 20. Czechia (CZE), 21. The Netherlands (NETH), 22. Canada (CAN), 23. Chile (CHI), 24. Portugal (POR), and 25. Romania (ROMA).

Eigenvalues and eigenvectors

We now describe the eigenvalues and corresponding eigenvectors of the correlation matrices of COVID-19 confirmed cases and deaths. We analyzed only the largest and second-largest eigenvalues and their corresponding eigenvectors. The time evolution of the first eigenvalue follows evolutionary dynamics because it exhibits sharp changes in some periods. The first eigenvalues represent the largest variances in the data, which can be used to identify severe effects of COVID-19. The dynamical changes in the first and second eigenvalues of COVID-19 confirmed cases and deaths over a one month time window are presented in Fig 1. The higher value of the largest eigenvalues suggests that affected countries are more similar in the state of the diseases. In Fig 1, one can see higher values for the first eigenvalues at the beginning of the COVID-19 outbreak, when the virus spread across the world during April of 2020. The largest eigenvalues for confirmed cases begin to decrease until June. Starting in July 2020, there is an increasing trend with small fluctuation for confirmed cases. However, the first eigenvalues for deaths are decreased up to August and then increases with fluctuation till February, 2021 which implies that deaths due to the effect of this disease is raised and correlated between the countries.
Fig 1

Changes in the first and second eigenvalues of COVID-19 deaths and confirmed cases.

The largest variances in the data can be observed at the beginning of COVID-19 during April and May.

Changes in the first and second eigenvalues of COVID-19 deaths and confirmed cases.

The largest variances in the data can be observed at the beginning of COVID-19 during April and May. When comparing the first eigenvalues of confirmed cases and deaths, one can see that at the beginning of COVID-19 from April to June of 2020, the COVID-19 dynamics of deaths are stronger than those of confirmed cases. After this period, the largest eigenvalue of deaths is lower than that of confirmed cases from July to November and the first eigenvalues are very similar in December. A sharp change in the COVID-19 dynamics of the deaths is observed again in February of 2021, resulting in a crossover with the COVID-19 confirmed cases. This indicates that COVID-19 death cases are more severe in this month. The second largest eigenvalues for deaths and confirmed cases increase from April to September and are approximately equal to the first eigenvalue of deaths in September. Later, both eigenvalues decrease in October and then increase again starting in November. However, for deaths, it becomes steady till January. When comparing the second eigenvalue of COVID-19 confirmed cases to deaths, one can see that the value of confirmed cases is greater than that that of deaths for most of the target period. We now consider the components of the first and second eigenvectors of the correlation matrix of COVID-19 confirmed cases and deaths. The components of the first eigenvector corresponding to the largest eigenvalue of the top 25 severe countries over time are shown in Fig 2(A). We found that the components of the eigenvector correspond to the largest eigenvalue and second largest eigenvalue change sign with evolution of time. We took the absolute value of eigenvectors, which is shown by heat map, because we want how much the components of eigenvector of these countries dominate on the first and second eigenvector. We are not concerned about the sign ambiguity of the eigenvectors. The components of the first eigenvector of COVID-19 confirmed cases and deaths are non-uniform in the target countries. The significant components of this eigenvector for deaths are greater than those of confirmed cases. These components fluctuate significantly with the intensity of diseases and their trends over time are dissimilar between countries. This is reasonable because the severity of diseases do not peak at the same time in different countries.
Fig 2

Components of eigenvectors (a) the first eigenvector for death cases, (b) second eigenvector for death cases, (c) the first eigenvector for confirmed cases, and (d) the second eigenvector for confirmed cases. The horizontal axis shows the month of the last data point of each time window. The vertical axis represents the affected countries. The maroon shaded areas indicate dominating eigenvectors of different affected countries.

Components of eigenvectors (a) the first eigenvector for death cases, (b) second eigenvector for death cases, (c) the first eigenvector for confirmed cases, and (d) the second eigenvector for confirmed cases. The horizontal axis shows the month of the last data point of each time window. The vertical axis represents the affected countries. The maroon shaded areas indicate dominating eigenvectors of different affected countries. The most influential components of the first eigenvector of deaths appear for US, Brazil, Russia, UK, Italy, Mexico, and Iran in different periods from June to November, as indicated by the maroon shaded area in Fig 2(A) and 2(C). The least influential components appear for the bottom ten countries, as indicated by the blue shaded areas. The components of the first eigenvector for US deaths are significant in June, August, November, and January, whereas for Brazil, these components are dominant from June to August. The components of the eigenvectors for confirmed cases and deaths in India are the most significant from August to October. The dominance of the eigenvectors for UK and Russia can be observed in different periods from June to November, whereas for Mexico, the greater values appear in July and August. The most influential components of the first eigenvector for Italy appear from September to January. It is noteworthy that there are no significant components of the first eigenvectors for France and Spain, despite COVID-19 causing many deaths in these countries. The most influential components of first eigenvector for COVID-19 confirmed cases can be observed for most countries in different periods from May to April shown in Fig 2(C). The components of the eigenvector corresponding to the second largest eigenvalue are presented in Fig 2(B) and 2(D). One can see that the components of the eigenvector of the second eigenvalue of COVID-19 confirmed cases and deaths are much greater that the components of the first eigenvector, but only contribute significantly in few periods, as indicated by the yellow and red shaded areas. The second eigenvector of deaths is more significant than that of confirmed cases. The components of deaths in the US and South Africa are the most dominant in October and April, respectively. Significant components can be identified for most countries in different periods for both confirmed cases and deaths. The most influential components of confirmed cases can be observed for the countries of Brazil, France, Spain, South Africa, Ukraine, Czech Republic, and Portugal. However, other countries also contribute moderately to the second eigenvector in different periods.

Group dynamics of COVID-19 countries

We now consider the group dynamics on the plane of the first PC (PC1) and the second PC (PC2) as shown in Fig 3. The correlations between PCs and normalized change of death or confirmed cases known as PC coefficients. We use the PC coefficients to measure the strengths of the countries and PCs relationship and to determine which country contributes more on two PCs. The more the contribution of a country on PCs, the more the severity of the diseases is. We projected the first two PC coefficients into two-dimensional spaces to see where the countries are located, which should provide additional information regarding the severity of diseases. The maximum variances of the data in the direction of the largest eigenvector are defined by first PC, while the second PC represents the variances in the direction of the second eigenvector. We selected the first two PCs because the variances explained by the first two PCs are significant for evaluating the severity of COVID-19. Loading plots of the first two PC coefficients for a one month time window are presented in Fig 3.
Fig 3

Scatter plots of PC1 and PC2 coefficients of the top 25 countries affected by COVID-19 (a) during April 2020 for confirmed cases, (b) April 2020 for death, (c) May 2020 for confirmed cases, (d) May 2020 for death, (e) October 2020 for confirmed cases, (f) October 2020 for death, (g) February for confirmed cases, and (h) February for deaths. We showed only the periods where group dynamic show significant meaning. Countries make strong cluster in the loading plots when the states of the diseases are similar and severe among the countries.

Scatter plots of PC1 and PC2 coefficients of the top 25 countries affected by COVID-19 (a) during April 2020 for confirmed cases, (b) April 2020 for death, (c) May 2020 for confirmed cases, (d) May 2020 for death, (e) October 2020 for confirmed cases, (f) October 2020 for death, (g) February for confirmed cases, and (h) February for deaths. We showed only the periods where group dynamic show significant meaning. Countries make strong cluster in the loading plots when the states of the diseases are similar and severe among the countries. Based on the closest points in the two-dimensional load plots, we identified different groups of countries over time. We observe that changes in the time window altered the groups and clusters. This could be a result of the varying severities of the disease in different countries. We observed that the numbers of clusters and countries contained in a cluster for COVID-19 confirmed cases were not similar to those for COVID-19 deaths. We will explain the group dynamics starting from April of 2020, when the virus began to spread across the world. Almost all countries are in the same cluster in April for confirmed cases. However, when we look for the clusters of COVID-19 deaths, there are two big groups. The cluster that is far from the origin and near to each other indicated by dotted ellipse labeled as ‘critical group’ in Fig 3(A) contains the countries of the US, UK, Italy, Span, France, Turkey, Germany, Iran, Canada, Czech Republic, Netherland, and Poland. Most of these countries are American and European. This indicates that these countries make the largest contributions to the two PCs. This also indicates that there were critical conditions for these countries caused by COVID-19 in April of 2020. The countries in the second-largest cluster are Brazil, Argentina, India, Peru, and Russia. The conditions in these countries were moderate during April. The other countries are widely scattered. Regarding the largest clusters in May for COVID-19 confirm cases, three big clusters can be observed, where the largest cluster contains the countries of the US, Canada, Mexico, UK, Netherland, Italy, Romania, Turkey, India. These countries are far from the origin and close to each other, implying a similar state of diseases because they contribute equally to the two PCs. However, the other two clusters are closer to the origin and far from each other, implying the smaller contribution to the two PCs which may be caused by a moderate state of COVID-19 confirmed cases. However, one can see a different scenario for COVID-19 deaths in this month. Excluding Argentina, Chile, Portugal, Indonesia, South Africa, all countries are in the same cluster and we can conclude that the COVID-19 death scenario is almost the same in these countries. In October, more than ten countries form one big cluster for COVID-19 confirmed cases and there are also other small clusters. However, the points are far from each other overall, indicating varying states of COVID-19. The critical groups split into two and three large clusters in October for COVID-19 confirmed cases and deaths, as shown in Fig 3(E) and 3(F), respectively. The Countries in the large two clusters during October for confirmed cases and deaths are European, implying that the European countries are in a similar situation. The American countries are in the other largest cluster for death group. Finally, in February, the countries form three and four large clusters for COVID-19 confirmed cases and deaths, respectively, and one can see that the countries of some groups are near to each other and far from the origin, indicating that some countries faced severe diseases scenarios. The group that is far from the origin for COVID-19 confirm cases contains the countries of the US, UK, Canada, Russia, Colombia, Portugal, Indonesia, South Africa, which represent the worst scenarios for COVID-19. Mexico, Germany, and Argentina form another cluster and there is another large cluster shown in Fig 3(G). For COVID-19 deaths, there are two critical groups and the points are close to each other, containing the countries of America, Asia and Europe shown in Fig 3(H). During this period, Iran, France, Span and Argentina form a group near to origin, implying that the contribution of these countries to the two PCs represent good scenarios of COVID-19. Almost all countries contribute equally to the first two PCs at the beginning of diseases progression and form a strong cluster. However, countries form different clusters over time based on the intensification of diseases because some countries reduce disease severity through lockdown or other countermeasures, resulting in various clusters. Countries that are far from origin and close to each other imply equal contributions to the two PCs and we identify these countries as critical groups. Countries in critical groups are heavily affected by the intensification of diseases. Therefore, we can identify the countries in which COVID-19 is more severe.

Pandemic states of countries

We now consider the correlation C1(d1, y1) between the normalized logarithmic changes of death or confirmed cases and PC1. The change of correlations C1 over time can be used to identify the COVID-19 states of different countries. Because the largest eigenvalues represent the strongest COVID-19 dynamics, the principal PCs denoted by y are appropriate for observing COVID-19 states over time. The greatest variations in the data are represented by the first PCs. The temporal changes in the first PCs are also used in finance to identify the states of different markets [17, 18]. We identified the COVID-19 states of countries based on the changes in the correlations of each country contributing to PC1 between two monthly time windows such as ΔC1 = C1(T+1)−C1(T) where T is the time period. The changes in ΔC1 for COVID-19 confirmed cases and deaths are visualized by the heat map shown in Fig 4. Changes in color indicate changes in the state of COVID-19 in a country.
Fig 4

Time varying change of PC for (a) COVID-19 death cases and (b) confirmed cases. The change of color indicates the change of the states of the diseases.

Time varying change of PC for (a) COVID-19 death cases and (b) confirmed cases. The change of color indicates the change of the states of the diseases. Severe states of COVID-19 are identified by red shaded areas, while deep blue shaded areas represent moderate states. Minor changes in COVID-19 state are identified by the yellow color. Excluding the countries of Iran, Chile, and Romania, we found that there were no changes in the states of COVID-19 deaths from April to July in the top countries. A significant change in the COVID-19 state can be observed in August for most countries, implying a moderate state of COVID-19 during this period. In September, severe state of COVID-19 can be observed in some countries such as the UK, Russia, Germany, and Iran. This pattern changes in October excluding the state in India. Most countries enter a severe state of COVID-19 again in November and return to normal state in December. Severe states of COVID-19 deaths can be observed again in January and February in most countries. During these periods, a milder state of COVID-19 death can be observed in Spain, Argentina, Indonesia, Peru, and Chile. Large changes of COVID-19 confirmed cases cannot be observed for all countries in same periods, as shown in Fig 4(B), which may be a result of lockdowns. Lockdown policies vary from country to country. A significant change in the first PCs can be observed for the US and Russia in June and July, respectively. Some other countries such as Iran, Mexico, Argentina also exhibit state changes from April to July. In September, the mild states of some countries can be observed in Fig 4(B), as indicated by the yellow shaded areas. Small changes in COVID-19 confirmed cases can be observed in most of the countries from September to December, with a severe state returning in December in almost all countries. However, unlike COVID-19 death cases, a moderate state of COVID-19 confirm cases can be observed in February in almost all countries. Additional research is required to analyze this phenomenon, which we will leave for future investigations. The changes in states of COVID-19 deaths and confirmed cases did not occur for all countries in the same period. In particular, changes in the state of COVID-19 confirm cases can be observed for various countries in different periods from April to December. This may be a result of lockdown policies. However, the changes in states in most countries follow a similar pattern from December to February. Time series of COVID-19 is an experimental data and researchers have been used different methods to extract information from the data. PCA is an important technique to find out the relation among the affected countries and to give forecast for the transmission of diseases near future. Mahmoudi et al., classify seven severely affected countries into two classes using two factors of FA technique projected onto two dimensional spaces [14]. They also showed the relations between the high risk countries projecting first two PCs onto two dimensional spaces [15]. Using first four PCs, Duarate et al., showed cyclical patterns of COVID-19 over time for various countries. The authors also showed that the incidence curves of the COVID-19 had strong similarities among countries and can predict the trend of cycles near future. Our study finds that the largest eigenvalues of the correlation matrix in the monthly time frame can be used to observe the dynamics of COVID-19 across the world. Since the time evolution of the largest eigenvalue shows the sharp change in a particular period. For example, the sharp rise of the largest eigenvalue in January for confirmed cases imply that the COVID-19 can be worsen near future. The eigenvectors corresponding to the largest eigenvalue are an indication of changes in the correlation matrix. The components of the eigenvector for the largest eigenvalue show non-uniform composition, which imply that some countries contribute it significantly. The non-uniform composition of the components of the first eigenvector can classify the countries in the basis of the severity of the diseases. For example, the significant components of the first eigenvector in August are found for US, Brazil, Russia, Mexico which indicate critical period of the disease in these countries. Moreover, the graphical approach in Fig 2(A) indicate that the components of the first eigenvector are changing over time which identify different periods of the diseases. For example, in case of Brazil, the severe state of the disease was found from June to August while normal state was in September and October. Similarly, we can identify different states of the diseases for other countries using the components of the first eigenvector. We also classify the countries over time using the first two PCs coefficients and observe the similarities of the countries. The time evolution of COVID-19 can be helpful to understand the states and dynamics of the pandemic and to make policy for the future.

Conclusion

The technique of principal component analysis were used to investigate the time evolving correlation structures of the top 25 affected countries and to study the states of the diseases in these countries. We observed that the diseases dynamics were faster in the beginning of COVID-19 outbreak. During these early periods, COVID-19 deaths were increased faster than confirmed cases. The dynamics of deaths again increased from September with fluctuation and reached at peak in February 2021. The components of the second eigenvector were more significant than the components of the first eigenvector for some countries in some periods. We also studied the time evolving relationship between different countries by investigating the correlations between the cumulative change of confirmed and deaths by first two PCs. We projected the first two PC coefficients onto two dimensional spaces and found strong clusters in particular periods indicating similar state of the diseases in these countries. The groups that were far from the origin and contained points close to each other can be considered similar and also severe state of the diseases. The first PC contained the largest variances of the data and the correlation C1 between the cumulative change of confirmed and deaths with first PCs is considered to identify the COVID states of the different countries. Using time varying C1, we identify the periods of a country when it enters in severe state of the diseases and come back to normal state. When we look for the change of C1 for death cases, we found that American and European countries were in severe state of the diseases from September to November 2020. We found that the states of COVID-19 confirmed cases are different than those of COVID-19 deaths and that countries change their states over time. We seem that PCA on time evolving correlations for the change of cumulative COVID-19 cases of the affected countries all over the world can be used to understand the states and dynamics of COVID-19. We can propose the similar strategic policies to overcome the spreading disease such as the pharmaceutical and nonpharmaceutical methods. Because the dynamics of the spreading disease is changing over time, the PCA depends on the analyzing period. Therefore, the strategic policies are also changing over time. In further work, we will apply feature ranking of machine learning on time series data to find out the relation among the countries and to predict the state of the diseases. More details, we will leave for our next article. 20 Sep 2021 PONE-D-21-24857Application of principal component analysis on temporal evolution of COVID-19PLOS ONE Dear Dr. Lee, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.The manuscript requires further several revisions regarding study novelty and contribution to the related domain, related literature, quantitative framework and discussion, as well as future research directions. Please submit your revised manuscript by Nov 04 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Stefan Cristian Gherghina, PhD. Habil. Academic Editor PLOS ONE Journal requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We suggest you thoroughly copyedit your manuscript for language usage, spelling, and grammar. If you do not know anyone who can help you do this, you may wish to consider employing a professional scientific editing service. Whilst you may use any professional scientific editing service of your choice, PLOS has partnered with both American Journal Experts (AJE) and Editage to provide discounted services to PLOS authors. Both organizations have experience helping authors meet PLOS guidelines and can provide language editing, translation, manuscript formatting, and figure formatting to ensure your manuscript meets our submission guidelines. To take advantage of our partnership with AJE, visit the AJE website (http://aje.com/go/plos) for a 15% discount off AJE services. To take advantage of our partnership with Editage, visit the Editage website (www.editage.com) and enter referral code PLOSEDIT for a 15% discount off Editage services.  If the PLOS editorial team finds any language issues in text that either AJE or Editage has edited, the service provider will re-edit the text for free. Upon resubmission, please provide the following: The name of the colleague or the details of the professional service that edited your manuscript A copy of your manuscript showing your changes by either highlighting them or using track changes (uploaded as a *supporting information* file) A clean copy of the edited manuscript (uploaded as the new *manuscript* file) 3. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section. 4. Thank you for stating the following in the Acknowledgments Section of your manuscript: “This work was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2019S1A5C2A03081234).” We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.” Please include your amended statements within your cover letter; we will change the online submission form on your behalf. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: No ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: No ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: This is an interesting contribution to the existing literature, but the paper suffers from several shortcomings listed in the following comments. - The paper should be checked by a native. - A discussion section should be added. - The introduction should be updated by recent researches. - The novelty and contribution should be clearly bolded. - The authors should consider some works about Data Analysis that can be applied to model different datasets. For example, - The authors should consider some works about Data Analysis that can be applied to model Covid Datasets. For example, Time series modelling to forecast the confirmed and recovered cases of COVID-19, Travel medicine and infectious disease 37, 101742. Modeling and forecasting the spread and death rate of coronavirus (COVID-19) in the world using time series models, Chaos, Solitons & Fractals 140, 110151. Principal component analysis to study the relations between the spread rates of COVID-19 in high risks countries, Alexandria Engineering Journal 60 (1), 457-464. Fuzzy clustering method to compare the spread rate of Covid-19 in the high risks countries, Chaos, Solitons & Fractals 140, 110230. Factor analysis approach to classify COVID-19 datasets in several regions, Results in Physics 25, 104071. It’s better to suggest some subjects for future works. Best regards, Reviewer #2: Outstanding work. PCA in the context of time series is a reliable method to identify similarities between countries in terms of severity and to anticipate new surges. I would suggest to contrast their findings with similar studies in the discussion and to suggest ways to use their analysis in a practical way to forecast new surges and outbreaks. I suggest the authors consider contrasting the study with one produced by our group: Duarte P, Riveros E. Understanding the cycles of COVID-19 incidence: Principal Component Analysis and interaction of biological and socio-economic factors. Annals of Medicine and Surgery. 2021 Jun 1:102437. Reviewer #3: I failed to identify some points as follows: 1. What is the hypothesis and problem that you address in this paper? 2. Does only PCA enough to explain this complex issue as the scientist are striving to know the behavior of the virus and its mutation. 3. What could be the implications of your study in epidemiology? 4. Is that only focus to check the applicability of PCA methods? 5. `By studying the temporal evolution of the correlations of the different affected countries, we uncover the periods when there were major changes in the correlation structure of the global pandemic in the analyzed countries.` to know this country wise data is enough I think what new things can be added by this research? is unclear. 6. The variation in confirmed cases and deaths among different countries is obvious it need not to say after PCA. 7. Why the dynamic trend is changing is not explored here. Multiple factors might be in fact certainly involved from governance to economy. Overall, I did not find any novel outcome from this study. Thus, I am not recommending its publication in this journal. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 21 Oct 2021 Thank you for your fruitful comments. Reviewer #1: This is an interesting contribution to the existing literature, but the paper suffers from several shortcomings listed in the following comments. - The paper should be checked by a native. Ans: A native person checked the paper. - A discussion section should be added. Ans: We added the discussion in the manuscript. Please see the last paragraph of result analysis. - The introduction should be updated by recent researches. Ans: The introduction is updated by recent articles. We added some sentences in the third paragraph of Introduction part. - The novelty and contribution should be clearly bolded. Ans: The novelty is given in the last paragraph of introduction part. - The authors should consider some works about Data Analysis that can be applied to model different datasets. Ans: We add the related articles and update our work. We add new references [14-21]. Reviewer #2: Outstanding work. PCA in the context of time series is a reliable method to identify similarities between countries in terms of severity and to anticipate new surges. I would suggest to contrast their findings with similar studies in the discussion and to suggest ways to use their analysis in a practical way to forecast new surges and outbreaks. I suggest the authors consider contrasting the study with one produced by our group: Duarte P, Riveros E. Understanding the cycles of COVID-19 incidence: Principal Component Analysis and interaction of biological and socio-economic factors. Annals of Medicine and Surgery. 2021 Jun 1:102437. Ans: We added the discussion in the last paragraph of result analysis. We add the reference in [16]. Reviewer #3: I failed to identify some points as follows: 1. What is the hypothesis and problem that you address in this paper? Ans: We analyze the time series of COVID-19 to extract information about relations and the dynamics of COVID-19 in the top 25 countries. We want to investigate the dynamical similarity among countries. The similarity of strategic policies should be influenced to control the COVID-19. We gave some points in last paragraph of Introduction. 2. Does only PCA enough to explain this complex issue as the scientist are striving to know the behavior of the virus and its mutation. Ans: Of course, only PCA is not enough to explain these issues. However, we believe that the PCA is one of the powerful methods to understand the dynamics of disease spreading. Genome sequence analysis is necessary to know the behavior of the virus and its mutation. However, genome sequence analysis is critical in understanding the COVID-19 and preventing the spread of the disease. Different techniques such as PCA, Machine learning, artificial neural network can be applied to analyze genome sequence of COVID-19. We will apply these techniques on gene data for further investigation. We include some methods to explain the dynamical features of COVID-19 in second paragraph of Introduction part. 3. What could be the implications of your study in epidemiology? Ans: The transmission of diseases in the severe affected countries can be understood by using PCA. From the PCA, we can observe the strong correlation of the dynamical change for the spreading disease. We can propose the similar strategic policies to those countries. I include these points in last paragraph in Conclusion part 4. Is that only focus to check the applicability of PCA methods? Ans: Actually, we apply PCA technique to the correlations for the change of COVID-19 cases and death among the severe affected countries to understand the states and dynamic of COVID-19 in the analyzed countries. This analysis is also helpful to determine the relationships among these countries. In future, we will apply feature ranking of machine learning on gene data to understand the behavior of virus and its mutation. 5. `By studying the temporal evolution of the correlations of the different affected countries, we uncover the periods when there were major changes in the correlation structure of the global pandemic in the analyzed countries.` to know this country wise data is enough I think what new things can be added by this research? is unclear. Ans: PCA is a technique to reduce data dimension. Applying PCA on correlation structure, we try to show that only first two PCs are enough to observe the changes of the states in the analyzed countries and to make relationships among these countries. We gave some comments in the last paragraph in section . 6. The variation in confirmed cases and deaths among different countries is obvious it need not to say after PCA. Ans: Yes. You are right. We wanted to identify the periods to observe the COVID-19 states in the analyzed countries by heat map so that we can make a comparative study among the countries in the basis of severity. 7. Why the dynamic trend is changing is not explored here. Multiple factors might be in fact certainly involved from governance to economy. Ans: The change of correlation structure among the countries is identified by the largest eigenvalue and is used to observe the dynamics of COVID-19. The correlation structure for the change of COVID-19 cases and deaths among the countries is altered due to lock down, making social distance, hard immunity and for other policies and consequently, the dynamic trend is changed. Some articles considered other factors like socio-economic factors in new added Ref [16]. In the future study, we will consider other possible factors. Submitted filename: jwl_rebuttal.docx Click here for additional data file. 19 Nov 2021 Application of principal component analysis on temporal evolution of COVID-19 PONE-D-21-24857R1 Dear Dr. Lee, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Stefan Cristian Gherghina, PhD. Habil. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No 23 Nov 2021 PONE-D-21-24857R1 Application of principal component analysis on temporal evolution of COVID-19 Dear Dr. Lee: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Stefan Cristian Gherghina Academic Editor PLOS ONE
  18 in total

1.  Nonlinear dynamic analysis of an epidemiological model for COVID-19 including public behavior and government action.

Authors:  C A K Kwuimy; Foad Nazari; Xun Jiao; Pejman Rohani; C Nataraj
Journal:  Nonlinear Dyn       Date:  2020-07-16       Impact factor: 5.022

2.  Time series modelling to forecast the confirmed and recovered cases of COVID-19.

Authors:  Mohsen Maleki; Mohammad Reza Mahmoudi; Darren Wraith; Kim-Hung Pho
Journal:  Travel Med Infect Dis       Date:  2020-05-13       Impact factor: 6.211

3.  Cluster-based dual evolution for multivariate time series: Analyzing COVID-19.

Authors:  Nick James; Max Menzies
Journal:  Chaos       Date:  2020-06       Impact factor: 3.642

4.  Strong correlations between power-law growth of COVID-19 in four continents and the inefficiency of soft quarantine strategies.

Authors:  Cesar Manchein; Eduardo L Brugnago; Rafael M da Silva; Carlos F O Mendes; Marcus W Beims
Journal:  Chaos       Date:  2020-04       Impact factor: 3.642

5.  The outbreak of COVID-19: An overview.

Authors:  Yi-Chi Wu; Ching-Sung Chen; Yu-Jiun Chan
Journal:  J Chin Med Assoc       Date:  2020-03       Impact factor: 2.743

6.  Propagation analysis and prediction of the COVID-19.

Authors:  Lixiang Li; Zihang Yang; Zhongkai Dang; Cui Meng; Jingze Huang; Haotian Meng; Deyu Wang; Guanhua Chen; Jiaxuan Zhang; Haipeng Peng; Yiming Shao
Journal:  Infect Dis Model       Date:  2020-03-31

7.  Outbreak of pneumonia of unknown etiology in Wuhan, China: The mystery and the miracle.

Authors:  Hongzhou Lu; Charles W Stratton; Yi-Wei Tang
Journal:  J Med Virol       Date:  2020-02-12       Impact factor: 2.327

Review 8.  The SARS-CoV-2 outbreak: What we know.

Authors:  Di Wu; Tiantian Wu; Qun Liu; Zhicong Yang
Journal:  Int J Infect Dis       Date:  2020-03-12       Impact factor: 3.623

9.  Understanding the cycles of COVID-19 incidence: Principal Component Analysis and interaction of biological and socio-economic factors.

Authors:  Pablo Duarte; Efrain Riveros-Perez
Journal:  Ann Med Surg (Lond)       Date:  2021-06-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.