| Literature DB >> 35717436 |
Luryane F Souza1,2, Tarcísio M Rocha Filho3, Marcelo A Moret4,5.
Abstract
We classify the main variants of the SARS-CoV-2 virus representing a given biological sequence coded as a symbolic digital sequence and by its evolution by a cellular automata with a properly chosen rule. The spike protein, common to all variants of the SARS-CoV-2 virus, is then by the picture of the cellular automaton evolution yielding a visible representation of important features of the protein. We use information theory Hamming distance between different stages of the evolution of the cellular automaton for seven variants relative to the original Wuhan/China virus. We show that our approach allows to classify and group variants with common ancestors and same mutations. Although being a simpler method, it can be used as an alternative for building phylogenetic trees.Entities:
Mesh:
Year: 2022 PMID: 35717436 PMCID: PMC9206224 DOI: 10.1038/s41598-022-14404-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Coding for each of the 20 possible amino acids[28].
| Amino acids | K | N | D | E | P | Q | R |
|---|---|---|---|---|---|---|---|
| Decimal code | 6 | 8 | 9 | 10 | 11 | 12 | 13 |
| Binary code | 00110 | 01000 | 01001 | 01010 | 01011 | 01100 | 01101 |
Figure 1Rule 184 from[36] for an elementary cell automaton with three neighbors. The state 0 is represented in white and 1 in black.
Figure 2Evolution of protein cellular automata from coding in from Table 1 and the Wolfram’s rule in Fig. 1, for the different variants. The horizontyal and vertical axes are the cell number i and the evolution step t, respectively.
Figure 3Left: Hamming distance as a function of step t for the time evolution of the cellular automata associated to the spike protein between each variant and the initial Wuhan strain. Right: Zoom over the initial values of t.
Mutations the Spike protein of the SARS-CoV-2 variants from[39].
| Variant | Mutations |
|---|---|
| Alpha | HV69-70del, Y145del, N501Y, A570D, D614G, P681H, T716I, S982A, D1118H |
| Beta | L18F, D80A, D215G, R246I, K417N, E484K, N501Y, D614G, A701V |
| Gamma | L18F, T20N, P26S, HV69-70del, D138Y, Y145H, R190S, K417T, E484K, N501Y, D614G, H655Y, T1027I, V1176F |
| Delta | T95I, G142D, E154K, L452R, E484Q, D614G, P681R, Q1071H |
| B.1.1.28 | HV69-70del, Y145del, D614G, V1176F |
| P2 | HV69-70del, Y145del, E484K,D614G,V1176F |
| Omicron | A67V, HV69-70del, T95I, G142D, VYY143-145del, N211I, L212del, G339D, S371L, S373P, S375F, K417N, N440K, G446S, S477N, T478K, E484A, Q493R, G496S, Q498R, N501Y, Y505H, T547K, D614G, H655Y, N679K, P681H, N764K, D796Y, N856K, Q954H, N969K, L981F |
Figure 4Phylogenetic tree of SARS-CoV-2 variants and the Wuhan strain sequences from the neighbor-joining method[43].