Literature DB >> 35816483

GenomeBits insight into omicron and delta variants of coronavirus pathogen.

Abstract

We apply the new GenomeBits method to uncover underlying genomic features of omicron and delta coronavirus variants. This is a statistical algorithm whose salient feature is to map the nucleotide bases into a finite alternating (±) sum series of distributed terms of binary (0,1) indicators. We show how by this method, distinctive signals can be uncovered out of the intrinsic data organization of amino acid progressions along their base positions. Results reveal a sort of 'ordered' (or constant) to 'disordered' (or peaked) transition around the coronavirus S-spike protein region. Together with our previous results for past variants of coronavirus: Alpha, Beta, Gamma, Epsilon and Eta, we conclude that the mapping into GenomeBits strands of omicron and delta variants can help to characterize mutant pathogens.

Entities: Chemical

Mesh：

Substances：

Year: 2022 PMID： 35816483 PMCID： PMC9273097 DOI： 10.1371/journal.pone.0271039

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.752

Introduction

Since the coronavirus outbreak in Wuhan, China, in December 2019, the SARS-CoV-2 pandemic has became a major risk in global public health. The impact of the outbreak on access to healthcare services has left important repercussions. Severe effects on the mental health and well-being of medical staff and people around the world have had also a lot of relevant implications [1]. Many other sectors such as world economic systems have also suffered significantly due to the induced Covid-19 restrictions. Understanding the coronavirus pathogen is still a global challenge for scientific research. The identification by Similarity studies and fast genomic analysis of the positive-stranded RNA virus –continuously provided through the complete genome sequences confirmed from different laboratories around the world, allowed to shed light into the evolutionary origins of SARS-CoV-2 lineage [2]. These studies suggested that the SARS-CoV-2 genome may be in fact formed via recombination of genomes close to the RaTG13 and GD Pangolin CoV genomes, and be a close relative of bat CoV ZC45 and ZXC21. However, the intermediate source species of SARS-CoV-2 have not been confirmed so far (for a updated preview see Ref [3]). The importance of understanding the origin of this coronavirus, natural or unnatural, will help to prevent possible future pandemics. This debate, however, highlights the need for a global network of real-time surveillance systems, with the capacity to rapidly deploy genomic tools and statistical studies for pathogen identification and characterization of the evolution of potential lineages and mutations. SARS-CoV-2 is still spreading worldwide with evolving variants, some of which occur in the Spike protein and appear to increase viral infection [4]. The recent study in Ref [4], emphasizes the need to track and analyze viral sequences also in relation to clinical status. To this last aim we consider in this work, and apply anew, the GenomeBits method [5]. This is a simple statistical algorithm whose salient feature is to map the nucleotide bases A,C,G,T into a finite alternating (±) sum series of distributed terms of binary (0,1) indicators. The method can provide additional information to conventional comparative Similarity studies via alignment methods [6], specially on single nucleotide structures to detect the effects of mutations. That is, within the single nucleotide polymorphisms of a genome sequence. These polymorphisms reveal the complex dynamic gene transfer-recombination events among lineages with potential to cause major human outbreaks [7]. We report here a kind of ‘ordered’ (or constant) to ‘disordered’ (or peaked) transition around the S-spike protein region. Previously by GenomeBits [5], we uncovered distinctive signals for the intrinsic gene insights in the coronavirus genome sequences for past variants of concern: Alpha, Beta and Gamma, and the variants of interest: Epsilon and Eta. We enhance here this previous study of GenomeBits into the most recent delta and omicron variants of the coronavirus pathogen. The first known confirmed delta variant was detected in India in late 2020 and the B.1.1.529 infection appeared from a specimen collected in South Africa a year later, early November 2021 [8, 9]. Omicron variants have grown dominant world-wide and seems to be continuously evolving. Evolution selects those mutations that replicate more efficiently. The delta and omicron variants share some parts of their structures [10, 11]. Both variants have common mutations, i.e., amino acid changes in the building blocks that conform the spike protein (D614G mutation), and different mutations elsewhere (the P323L mutation in the NSP12 polymerase and the C241U nucleotide mutation in the 5’ untranslated region). The latter seems to give a greater advantage for their replicating capacity. Omicron seems to cause less severe COVID-19 than delta as seen from data on duration of hospital stays, ICU admittance and deaths which are reported lower than during previous pandemic waves [12]. The ongoing SARS-CoV-2 research is currently focusing on understanding the essential functions of the conforming proteins in the ribonucleic acid RNA coronaviruses [13]. These consists of an unusually large collection of RNA synthesizing and processing enzymes to express and replicate genome sequences that are targets for antiviral drug design. One possibility to unravel genome organization and variants of viruses is through statistical approaches as shown by the present results.

Comparison methods

Genome sequence comparisons and the discovery of new signaling pathways can be analyzed using the GenomeBits method [5, 14]. We apply GenomeBits to uncover distinctive patterns from delta and omicron genome sequences available in the GISAID archive at www.gisaid.org. As discussed below, this method provides additional information to conventional approaches via Similarity comparisons (for a comparison with Fourier Power Spectrum studies see [5]).

Similarity plots

Similarity between pairs of full-length genome sequences is the standard method for determining whether there are sequence equivalences in terms of shared ancestry between them by using alignment methods [6, 15]. We show in Fig 1, genetic Similarity plots of different query sequences of SARS-CoV-2 genome. We downloaded genome sequence data in FASTA format collected in December 2021 (whose GISAID accession IDs are also indicated in the figures). The FASTA format is the commonly used text-based format for representing nucleotide sequences in which amino acids are represented by single-letter codes: (A)denine, (C)ytosine, (G)uanine and (T)hymine (or Uracil RNA genome for single strand folded onto itself). They store the instructions to assemble and reproduce every living organism.

Fig 1

Similarity plots.

Similarity plots.

Upper curves: genetic similarity curves between the query sequence SARS-CoV-2 Wuhan-Hu-1 and representative delta and omicron complete genome sequences. In clear blue is the genomic region encoding the spike (S-protein). Lower curves: delta genome sequences used as query against omicron data from Spain and USA. A typical sliding 1000 base pair window in steps of 100 nucleotide bases position was used in these calculations. By the Similarity plot, we verified more deviations of omicron variants than delta variants from Spain and USA with respect to the first Wuhan-China sequences identified over a year ago (MN908947) [16]. The lower curves in Fig 1 display larger deviations of delta variants from Spain (EPI ISL 7900475 and 7951710) against omicron variants from Spain, as well as the delta variant from USA (EPI ISL 8179449) against omicron variants from USA. In these calculations we used “lalign36” sequence comparison software via the Waterman–Eggert algorithm at http://github.com/wrpearson/fasta36 [5]. Conventional Similarity comparisons via “lalign36” alignment provides limited information on the single nucleotide bases A,C,G,T. To determine the best parameters to achieve optimal alignments is difficult. There are several user-defined parameters to overcome gaps and mismatches usually found between genome sequences. Furthermore, the computational resources required increase considerably depending on the length and number of sequences to be aligned. In the figure regions with clustering of SARS-CoV-2 sequences (< 1%) from the city of Wuhan in relation to the delta lineages from SPAIN, suggest some genetic similarities outside the S-spike gene region (bp 21563–25384, colored in clear blue). More divergent genetic similarities (∼ 97%) are found between the first Wuhan-China sequences and omicron strains from Spain and USA, and in between the delta sequences from Spain and USA against omicron variants from Spain and USA, respectively. This is a clear consequence of the coronavirus mutations.

GenomeBits representation

Our new quantitative method for the examination of distinctive patterns of complete genome sequences considers a certain type of alternating series having terms converted to (0,1) binary values for the nucleotide variables α = A, C, T, G as observed along the reported genome sequences, namely where the individual terms X are associated with 0 or 1 values according to their position along the genome sequences of length N, satisfying the following relation The arithmetic progression carries positive and negative signs (−1) and a finite non-zero first moment of the independently distributed variables X. The mapping into four binary projections of genome sequences follows previous studies on the three-base periodicity characteristic of protein-coding DNA sequences [17]. However, as a principal difference with other binary representations (see also, e.g., Refs [18, 19]), in our mapping the terms in the sums change sign. If a term X is positive at a given nucleotide base position (bp) k, then the successive X term is negative and vice versa. This selection is inspired by the well-know model of Ising spins model in Physics in which discrete variables representing magnetic dipole moments of atomic spins in a lattice can be in one of two states: +1 or −1. In our case the binary indicator for the sequences carries alternating ± signs where plus and minus signs are chosen sequentially starting with +1 at k = 1 as illustrated in the example of Table 1. In the Table a mapping example for converting the brief genome fragment AGATCTGTTCTC (consisting of 12 nucleotides) into the alternating binary array via Eq (1) is given.

Table 1

Example of genome sequence-to-GenomeBits mapping (via Eq (1) for N = 12).

Base position k	1	2	3	4	5	6	7	8	9	10	11	12	GenomeBits sums
Sequence (string)	A	G	A	T	C	T	G	T	T	C	T	C	∑k=112(-1)k-1Xα,k
(−1)^k−1 X_α=A,k	+1	0	+1	0	0	0	0	0	0	0	0	0	2
(−1)^k−1 X_α=C,k	0	0	0	0	+1	0	0	0	0	-1	0	-1	-1
(−1)^k−1 X_α=G,k	0	-1	0	0	0	0	+1	0	0	0	0	0	0
(−1)^k−1 X_α=T,k	0	0	0	-1	0	-1	0	-1	+1	0	+1	0	-1

The variable α represents single-letter nucleotide codes: (A)denine, (C)ytosine, (G)uanine and (T)hymine. Within the GenomeBits method, the ± signs are chosen sequentially starting with plus at the nucleotide base position k = 1 by default. There is a user-friendly Graphics User Interface (GUI) to the present signal analysis method of genome sequences. The GUI runs under Linux Ubuntu O.S. and can be downloaded freely from Github [14]. It is efficient and requires little processing time for large genomic data. Our GenomeBits GUI considers samples with A,C,T,G sequences for (up to two) given Countries corresponding to genomic sequence data from (up to six) given variants/species. It discards uncompleted sequences containing codification errors (usually denoted with “NNNNN” and other letters). In brief, with just one click the GenomeBits GUI allows to run the alternating sums in Eq (1) for up to six-times-two inputs of FASTA files containing (i.e., concatenating) more than one genome sequence each; separate concatenated genome sequences and save it in single FASTA files (for each country); get into single files, each of the four nucleotide bases represented by the symbols A,C,T,G; get the alternating sums results in single files for each of the four nucleotide bases A,C,T,G associated with (±) binary values; plot the alternating sums to compare behavior of pairs A,T and C,G nucleotide bases versus nucleotide bp; compare in a plot alternating sums curves versus bp for all four nucleotide bases A,C,T,G; plot the alternating sums curves versus bp for each of the four nucleotide bases A,C,T,G; plot results for each nucleotide base A,C,T or G for up to six variants/species and up to 4 FASTA files by country; comparison of GenomeBits GUI curves for (up to six) given variants/species and (up to two) selected Countries, with those results from our original paper in [5]. We shall show next that analyzing genomic sequencing via the present type of finite alternating sums allows to extract unique features for omicron and delta mutations with little data noise variations. From the viewpoint of statistics, such series are equivalent to a discrete-valued time series for the statistical identification and characterization of (random) data sets [20].

Discussion

The GenomeBits representation of coronavirus genome variants, by adding binary values with ± signs following Eq (1), can reveal interesting imprints of the genome dynamics at the level of nucleotide ordering. By this method of binary projections we are able to uncover distinctive signals of the intrinsic gene organization embedded in the genome sequences of the single-stranded RNA coronaviruses. This approach allow to judge nucleotide bases mutations between omicron and delta genome sequences by simply alternating their associated (±) binary coding as illustrated in Table 1. In Fig 2, we show the results obtained for the sequences of each A, C, G and T nucleotide of the coronavirus variants of concern—AY.4.2 (delta) and B.1.1529 (omicron) reported from Spain and the USA for a number of representative samples as indicated. As reference, we display on the left of the plot our results for nucleotides A,C of the strand and on the right the nucleotides T,G (“complementary to those of the opposite strand”—according to the pairing rules A-T and C-G of DNA). The complete genome sequences consists of N nucleotides on the order of 30,000 base pairs in length, two to three times larger than that of most other RNA viruses [13].

Fig 2

Sequence sum series.

Sequence sum series.

Delta (in blue) and omicron (in green) variant imprints displayed by the nucleotides A,C,G,T according to Eq (1) along different samples of the genomic strand of coronavirus available from Spain and USA. The arrows indicate a sort of ‘ordered’ (constant) to ‘disordered’ (peaked) transition before the coding region of the S-spike genes for the SARS-CoV-2 Wuhan-Hu-1 sequence (drawn in clear blue). It is interesting to note how in the figure there are regions where the curves for the delta variant (in blue) mirror those of the omicron variant (in green). This peculiar behavior becomes clear by averaging both curves as shown by the red lines. The regions of almost null (with low data noise), or rather constant average values, indicates rather perfect mirroring matching, which is driven by the ± signs of the alternating series. This reveals coding regions of correspondence between delta and omicron variants. The regions of main discrepancies as found in the Similarity identities curves of Fig 1, e.g., around N = 10000 are also reflected by the red lines of Fig 2. The main difference between both comparative genomic approaches is that changes via Eq (1) can be analyzed and characterized at each single A,C,G,T nucleotide level. This is important because single nucleotide polymorphisms are the most common genomic variations. The figure illustrate a kind of ‘ordered’ (constant) to ‘disordered’ (peaked) transition phenomena around the NSP-5 polymerase within the open reading frames ORF1a region [10, 11], up to the nucleotide region of the S-Protein (colored clear blue area). To some degree, there are also other distinctive trends especially around the S-Protein. As seen in Fig 2, the black arrows indicate a phase transition point appearing close to the coding region of the S-spike genes. The peaked curves diverge rapidly and tend to separate denoting bigger dissimilarities for increasing N. It is worth noting that the patterns for the base sequence series for Adenine and Cytosine display completely different convergences between the variants considered. The positive and negative terms in the sums of Eq (1) for the discrete α variables, partly cancel out allowing the series “to converge” to some non-zero values for all the nucleotide classes. At the black arrows positions, this feature allows to estimate a separation ratio of ∼ 2 between nucleotide C and G, and ∼ 4 times higher between the curves for A and T. We also note that the inclusion of different smoothing sliding window sizes of up to about 500 bp (moving along the target genome sequences and repeating the GenomeBits procedure as described, lead to a data noise reduction in the curves and preserve the average behavior of the sums displayed in Figs 1 and 2.

Conclusions

We have applied the GenomeBits method Eq (1) to uncover underlying genomic features of omicron and delta coronavirus variants. By this method, we have found a sort of ‘ordered’ (or constant) to ‘disordered’ (or peaked) transition around the coronavirus S-spike protein region. This result is significant because it has been obtained by assigning binary strings to symbolic nucleotide characters. Via this simple assignment, one can compare various gene sequences as done so far. Consequently, the present nucleotide mapping representation may be useful to archive the dynamical (history) properties of the original sequence. Numerical representations of genome sequences have gained great attention in bioinformatics studies. One advantage for this approach is that large sequence data can be handled statistically to find various characterizations. Additional properties of the genome sequences for mutant pathogens, as derived in this work for an ‘ordered’-to-‘disordered’ transition, may also allow to locate and distinguish polymers of amino acids (proteins states and positions) in a sequence and determine if the altered genes may behave similar to those already targeted. GenomeBits may shed light on the bioinformatics surveillance behind future infectious diseases. By a comparison of numerical results, it may be also of some relevance to assist in further developments of synthetic mRNA-based vaccine designs [13, 21]. Such comparative genomic statistical representations can offer insights on the inherent data organization during the natural evolution of pandemic. Letter sequence-to-numerical signal mappings are likely to continue in future genomic encodings of new sequences. (TEX) Click here for additional data file.

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present. 27 Apr 2022

PONE-D-22-01925

GenomeBits insight into omicron and delta variants of coronavirus pathogen

PLOS ONE Dear Dr. Canessa, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jun 11 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Vladimir Makarenkov Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf Additional Editor Comments: It would be important to add a discussion (preferably to the Introduction section) about the origins and impact of the SARS-Cov-2 pandemic. Here you could cite the following works: Boni, Maciej F., et al. "Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic." Nature microbiology 5.11 (2020): 1408-1417. Makarenkov, V., Mazoure, B., Rabusseau, G. et al. Horizontal gene transfer and recombination analysis of SARS-CoV-2 genes helps discover its close relatives and shed light on its origin. BMC Ecol Evo 21, 5 (2021). Domingo JL. What we know and what we need to know about the origin of SARS-CoV-2. Environ Res. 2021;200:111785. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: N/A Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: No Reviewer #2: No ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: 1. The write-up of the manuscript seems to be premature and very difficult to understand. There are several sweeping statements without any references. For instance, the statement “The first known confirmed delta variant was detected in India in late 2020 and the B.1.1.529 infection appeared from a specimen collected in South Africa a year later, early November 2021” should have a citation. Several sentences are incomplete or do not have a meaning. For example “Although omicron seems to cause less severe COVID-19 than delta.” 2. Authors should clearly explain the computation of binary scores for a given genomic sequence. Illustration or outline graphic may be required. 3. It is trivial that a mononucleotide composition can reveal similar information. How Genomebit information is different from mononucleotide computation with a sliding window?. 4. There is no discussion on their results. The authors should compare their analysis with their previous work and also with similar research by others. Reviewer #2: The manuscript on the topic GenomeBits insight into omicron and delta variants of coronavirus pathogen is an interesting research article. The manuscript is with the interest to the reader and fully in the scope of journal. I will suggest the manuscript to be accepted for publication after revision. 1. Abstract section looks incomplete. I will suggest the author to focus on following important points on writing the abstract. An abstract summarizes, usually in one paragraph of 300 words or less, the major aspects of the entire paper in a prescribed sequence that includes: 1) the overall purpose of the study and the research problem(s) you investigated; 2) the basic design of the study; 3) major findings or trends found as a result of your analysis; and, 4) a brief summary of your interpretations and conclusions. 2. English of the script is very poorly written. Please write your text in good English (American or British usage is accepted, but not a mixture of these). English language manuscript may require editing to eliminate possible grammatical or spelling errors and to conform to correct scientific English. 3. There are several sentences in the script which are really hard to understand. I will suggest the authors should carefully read the script and amend the English language correction throughout the script. 4. Introduction section need to be more elaborated. 5. Provide a general interpretation of the results in the context of other evidence, and implications for future research. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Dr Kishore Sesham Reviewer #2: Yes: SADANAND PANDEY [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 13 May 2022 See also attached file "Response_to_Reviewers.pdf" for our reply to each specific reviewer and editor comments. Response to Reviewers of PONE-D-22-01925 “GenomeBits insight into omicron and delta variants of coronavirus pathogen”, by E. Canessa, L. Tenze We are indebted to the academic editor and the two reviewers for providing insightful comments on the manuscript. Our responds to each point raised are as follows: �  Reply to Additional Editor Comments: The Introduction section was revised and completely rewritten. In particular, we added a discussion in the Introduction section about the origins of the SARSCov- 2 pandemic as sugested by the Editor, and cited the three works suggested: (i) Boni, Maciej F., et al. (ii) Makarenkov, V., Mazoure, B., Rabusseau, G. et al., and (iii) Domingo JL. We also briefly mentioned the impact of the SARS-Cov-2 pandemic and added 11 new references to support our findings throughout the text. The abstract and conclusions have also been improved based on the data presented. All sequence data analysed here are publicly available at GSIAD and all custom code used in the manuscript is fully available at https://github.com/canessae/GenomeBits/ We have made an effort to present our manuscript in an intelligible fashion with an standard English. At revision, we corrected few typographical and grammatical errors. �  Reply to Reviewer #1: 1. We have made an effort to rewrite our manuscript and present it in an intelligible fashion with an standard English. Our statements now carry out associated new references and few sentences have been completed with a clear meaning. 2. We have added a new Table to aIllustrate the computation of binary scores for a given genomic sequence. This Table helps us to explain that in our case, the mapping into four binary projections of genome sequences carries alternating plus and minus signs, where + and - signs are chosen sequentially starting with +1 at k = 1 as default. The Table shows a particular mapping 1 example for converting the brief fragment AGATCTGTTCTC of 12 nucleotides into the alternating binary array. As a principal difference with previous binary representations (see, e.g., new Refs [19, 20]), our mapping (whose terms change sign –i.e., if a term X_k is positive then X_k+1 is negative and vice versa) displays their occurrence as +1 or -1 as well as their non existence as 0 at a given base pair k. 3. The inclusion of different smoothing sliding window sizes of up to about 500 bp (moving along the target genome sequences and repeating the GenomeBits procedure as described, lead to a data noise reduction in the curves and preserve the average behavior of the sums displayed in Figures 1 and 2. 4. Our results have been further elaborated. We emphasize that the regions of main discrepancies as found in the Similarity identities curves of Fig 1, e.g., around N=10000 are also reflected by the red lines of Fig 2 via GenomeBits. The main difference between both comparative genomics approaches is that changes via Eq (1) can be analyzed and characterized at each single A,C,G,T nucleotide level separately. Beside such comparision of Similarity studies and our previous analysis on Fast Fourier Transforms for coronavirus genome of other variants as reported in Ref[6], our present findings are new and have not similars to those reported by others. We report a kind of 'ordered' (constant) to 'disordered' (peaked) phase transition phenomena around the NSP5 polymerase within the open reading frames ORF1a region, up to the nucleotide region of the S-Protein. As seen in Fig 2, the black arrows indicate an analogous phase transition point appearing close to the coding region of the Sspike genes. �  Reply to Reviewer #2: We particularly thanks this referee for finding our research article interesting and fully in the scope of PLOS One journal. 1. As suggested, the new Abstract now reads: “We apply the new GenomeBits method to uncover underlying genomic features of omicron and delta 2 coronavirus variants. This is a statistical algorithm whose salient feature is to map the nucleotide bases into a finite alternating (+-) sum series of distributed terms of binary (0,1) indicators. We show how by this method, distinctive signals can be uncovered out of the intrinsic data organization of amino acid progressions along their base positions. Results show a sort of 'ordered' (or constant) to 'disordered' (or peaked) transition around the coronavirus S-spike protein region. Together with our previous results for past variants of coronavirus: Alpha, Beta, Gamma, Epsilon and Eta, we conclude that the mapping into GenomeBits strands of omicron and delta variants can help to characterize mutant pathogens.” 2. We have made an effort to present our manuscript in an intelligible fashion with an standard English. At revision, we corrected few typographical and grammatical errors. 3. (idem as 2) 4. As also replied to the Academic Editor, the Introduction section was revised and completely rewritten. In particular, we added a discussion in the Introduction section about the origins of the SARS-Cov-2 pandemic and cited several new references. The abstract and conclusions have also been improved based on the data presented 5. In the Conclusion section we now provide a general reinterpretation of the results in the context of the present study and other evidence published, and also briefly discussed implications for future research. -oOo- Submitted filename: Response_to_Reviewers.pdf Click here for additional data file. 23 Jun 2022 GenomeBits insight into omicron and delta variants of coronavirus pathogen PONE-D-22-01925R1 Dear Dr. Canessa, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Vladimir Makarenkov Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: N/A ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: Yes: SADANAND PANDEY ********** 1 Jul 2022 PONE-D-22-01925R1 GenomeBits insight into omicron and delta variants of coronavirus pathogen Dear Dr. Canessa: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Vladimir Makarenkov Academic Editor PLOS ONE

15 in total

1. Evolution of long-range fractal correlations and 1/f noise in DNA base sequences.

Authors:
Journal: Phys Rev Lett Date: 1992-06-22 Impact factor: 9.161

2. Omicron and Delta variant of SARS-CoV-2: A comparative computational study of spike protein.

Authors: Suresh Kumar; Thiviya S Thambiraja; Kalimuthu Karuppanan; Gunasekaran Subramaniam
Journal: J Med Virol Date: 2021-12-27 Impact factor: 2.327

3. Challenges in Inferring Intrinsic Severity of the SARS-CoV-2 Omicron Variant.

Authors: Roby P Bhattacharyya; William P Hanage
Journal: N Engl J Med Date: 2022-02-02 Impact factor: 91.245

4. A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors: Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal: Nature Date: 2020-02-03 Impact factor: 69.504

5. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding.

Authors: Roujian Lu; Xiang Zhao; Juan Li; Peihua Niu; Bo Yang; Honglong Wu; Wenling Wang; Hao Song; Baoying Huang; Na Zhu; Yuhai Bi; Xuejun Ma; Faxian Zhan; Liang Wang; Tao Hu; Hong Zhou; Zhenhong Hu; Weimin Zhou; Li Zhao; Jing Chen; Yao Meng; Ji Wang; Yang Lin; Jianying Yuan; Zhihao Xie; Jinmin Ma; William J Liu; Dayan Wang; Wenbo Xu; Edward C Holmes; George F Gao; Guizhen Wu; Weijun Chen; Weifeng Shi; Wenjie Tan
Journal: Lancet Date: 2020-01-30 Impact factor: 79.321

6. Horizontal gene transfer and recombination analysis of SARS-CoV-2 genes helps discover its close relatives and shed light on its origin.

Authors: Vladimir Makarenkov; Bogdan Mazoure; Guillaume Rabusseau; Pierre Legendre
Journal: BMC Ecol Evol Date: 2021-01-21

Review 7. Structures and functions of coronavirus replication-transcription complexes and their relevance for SARS-CoV-2 drug design.

Authors: Brandon Malone; Nadya Urakova; Eric J Snijder; Elizabeth A Campbell
Journal: Nat Rev Mol Cell Biol Date: 2021-11-25 Impact factor: 113.915

8. The impact of the COVID-19 pandemic on the mental health of medical staff considering the interplay of pandemic burden and psychosocial resources-A rapid systematic review.

Authors: Julian Hannemann; Alan Abdalrahman; Yesim Erim; Eva Morawa; Lucia Jerg-Bretzke; Petra Beschoner; Franziska Geiser; Nina Hiebel; Kerstin Weidner; Susann Steudte-Schmiedgen; Christian Albus
Journal: PLoS One Date: 2022-02-22 Impact factor: 3.240

9. Discovery of a rich gene pool of bat SARS-related coronaviruses provides new insights into the origin of SARS coronavirus.

Authors: Ben Hu; Lei-Ping Zeng; Xing-Lou Yang; Xing-Yi Ge; Wei Zhang; Bei Li; Jia-Zheng Xie; Xu-Rui Shen; Yun-Zhi Zhang; Ning Wang; Dong-Sheng Luo; Xiao-Shuang Zheng; Mei-Niang Wang; Peter Daszak; Lin-Fa Wang; Jie Cui; Zheng-Li Shi
Journal: PLoS Pathog Date: 2017-11-30 Impact factor: 6.823

10. Uncovering Signals from the Coronavirus Genome.

Authors: Enrique Canessa
Journal: Genes (Basel) Date: 2021-06-25 Impact factor: 4.096