| Literature DB >> 35816483 |
Enrique Canessa1, Livio Tenze1.
Abstract
We apply the new GenomeBits method to uncover underlying genomic features of omicron and delta coronavirus variants. This is a statistical algorithm whose salient feature is to map the nucleotide bases into a finite alternating (±) sum series of distributed terms of binary (0,1) indicators. We show how by this method, distinctive signals can be uncovered out of the intrinsic data organization of amino acid progressions along their base positions. Results reveal a sort of 'ordered' (or constant) to 'disordered' (or peaked) transition around the coronavirus S-spike protein region. Together with our previous results for past variants of coronavirus: Alpha, Beta, Gamma, Epsilon and Eta, we conclude that the mapping into GenomeBits strands of omicron and delta variants can help to characterize mutant pathogens.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35816483 PMCID: PMC9273097 DOI: 10.1371/journal.pone.0271039
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Similarity plots.
Upper curves: genetic similarity curves between the query sequence SARS-CoV-2 Wuhan-Hu-1 and representative delta and omicron complete genome sequences. In clear blue is the genomic region encoding the spike (S-protein). Lower curves: delta genome sequences used as query against omicron data from Spain and USA. A typical sliding 1000 base pair window in steps of 100 nucleotide bases position was used in these calculations.
Example of genome sequence-to-GenomeBits mapping (via Eq (1) for N = 12).
| Base position | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | GenomeBits sums |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence (string) | A | G | A | T | C | T | G | T | T | C | T | C |
|
| (−1) | +1 | 0 | +1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| (−1) | 0 | 0 | 0 | 0 | +1 | 0 | 0 | 0 | 0 | -1 | 0 | -1 | -1 |
| (−1) | 0 | -1 | 0 | 0 | 0 | 0 | +1 | 0 | 0 | 0 | 0 | 0 | 0 |
| (−1) | 0 | 0 | 0 | -1 | 0 | -1 | 0 | -1 | +1 | 0 | +1 | 0 | -1 |
The variable α represents single-letter nucleotide codes: (A)denine, (C)ytosine, (G)uanine and (T)hymine. Within the GenomeBits method, the ± signs are chosen sequentially starting with plus at the nucleotide base position k = 1 by default.
Fig 2Sequence sum series.
Delta (in blue) and omicron (in green) variant imprints displayed by the nucleotides A,C,G,T according to Eq (1) along different samples of the genomic strand of coronavirus available from Spain and USA. The arrows indicate a sort of ‘ordered’ (constant) to ‘disordered’ (peaked) transition before the coding region of the S-spike genes for the SARS-CoV-2 Wuhan-Hu-1 sequence (drawn in clear blue).