| Literature DB >> 32927009 |
Sk Sarif Hassan1, Pabitra Pal Choudhury2, Bidyut Roy3.
Abstract
Envelope (E) protein is one of the structural viroporins (76-109 amino acids) present in the coronavirus. Sixteen sequentially different E-proteins were observed from a total of 4917 available complete genomes as on 18th June 2020 in the NCBI database. The missense mutations over the envelope protein across various coronaviruses of the β-genus were analyzed to know the immediate parental origin of the envelope protein of SARS-CoV2. The evolutionary origin is also endorsed by the phylogenetic analysis of the envelope proteins comparing sequence homology as well as amino acid conservations.Entities:
Keywords: Amino acid conservation; COVID-19; Envelope protein; Phylogeny; SARS-CoV2; Viroporin
Year: 2020 PMID: 32927009 PMCID: PMC7486180 DOI: 10.1016/j.ygeno.2020.09.014
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736
Fig. 1Domains of the envelope protein of β-CoVs.
Envelope protein of different host-CoVs.
| Host | Total | Distinct | % of Variability of the E protein |
|---|---|---|---|
| Bat | 79 | 25 | 31.646% |
| Camel | 269 | 9 | 3.346% |
| Cat | 42 | 17 | 40.476% |
| Cattle | 22 | 2 | 9.090% |
| Pangolin | 1 | 1 | 0% |
| Chimpanzee | 1 | 1 | 0% |
| SARS-CoV2 | 4917 | 19 | 0.3864% |
List of distinct envelope (E) proteins from different host CoVs and their respective protein ID.
| Protein ID | Host | Protein ID | Host | Protein ID | Host |
|---|---|---|---|---|---|
| AIA62357 | Bat-CoV | AIA62302 | Bat-CoV | ADO39821 | Feline-CoV |
| AIA62348 | Bat-CoV | AVP78044 | Bat-CoV | ACT10858 | Feline-CoV |
| AHY61342 | Bat-CoV | AVI15004 | Bovine-CoV | ACT10869 | Feline-CoV |
| ASL68958 | Bat-CoV | AVZ61113 | Bovine-CoV | ACT10909 | Feline-CoV |
| ASL68947 | Bat-CoV | ALA50082 | Camel-CoV | ACT10941 | Feline-CoV |
| ATQ39391 | Bat-CoV | QCI31474 | Camel-CoV | ACT10974 | Feline-CoV |
| AUM60029 | Bat-CoV | QBM11741 | Camel-CoV | ACT10920 | Feline-CoV |
| QDF43841 | Bat-CoV | ASU89926 | Camel-CoV | AWW13513 | Chimpanzee-CoV |
| YP_009072442 | Bat-CoV | ASU90554 | Camel-CoV | QIG55947 | Pangolin CoV |
| YP_009273007 | Bat-CoV | ANI69894 | Camel-CoV | QHZ00381 | Human-SARS-CoV-2 |
| ABD75324 | Bat-CoV | ALA49346 | Camel-CoV | QKI36855 | Human-SARS-CoV-2 |
| AGC74167 | Bat-CoV | ALA49390 | Camel-CoV | QKG87268 | Human-SARS-CoV-2 |
| AKZ19089 | Bat-CoV | ASU90334 | Camel-CoV | QKE45838 | Human-SARS-CoV-2 |
| ADK66843 | Bat-CoV | QDM36990 | Feline-CoV | QJR88103 | Human-SARS-CoV-2 |
| QDF43816 | Bat-CoV | AYF53097 | Feline-CoV | YP_009724392 | Human-SARS-CoV-2 |
| ATO98160 | Bat-CoV | AXE71624 | Feline-CoV | QKI36831 | Human-SARS-CoV-2 |
| ATO98184 | Bat-CoV | ASU62492 | Feline-CoV | QJS53352 | Human-SARS-CoV-2 |
| QDF43821 | Bat-CoV | ASU62503 | Feline-CoV | QJA42107 | Human-SARS-CoV-2 |
| ATO98135 | Bat-CoV | AUG98123 | Feline-CoV | QJQ84210 | Human-SARS-CoV-2 |
| AHX37560 | Bat-CoV | AMD11134 | Feline-CoV | QJR89447 | Human-SARS-CoV-2 |
| AIA62280 | Bat-CoV | AGT52084 | Feline-CoV | QJI54124 | Human-SARS-CoV-2 |
| ABD75313 | Bat-CoV | AEK25514 | Feline-CoV | QKU31207 | Human-SARS-CoV-2 |
| AIA62312 | Bat-CoV | AEK25525 | Feline-CoV | QKU37035 | Human-SARS-CoV-2 |
| QKV07065 | Human-SARS-CoV-2 | ||||
| QKU32371 | Human-SARS-CoV-2 | ||||
| QKU28584 | Human-SARS-CoV-2 | ||||
| QKU52835 | Human-SARS-CoV-2 | ||||
| QKV06741 | Human-SARS-CoV-2 |
Amino acid residues and their respective color and property used in Fig. 2.
| A,V,F,P,M,I,L and W | RED | hydrophobic (incl.aromatic —Y) |
| D and E | BLUE | Acidic |
| Rand K | MAGENTA | Basic - H |
| S,T,Y,H,C,N,G and Q | GREEN | Hydroxyl + sulfhydryl + amine + G |
Fig. 2Sequence alignment of the E protein of Bat CoV.
Missense mutations in the envelope protein of the Bat CoV.
| ATQ39391, AUM60029, AHY61342, ASL68958, | Y2L | N-terminal |
| ASL68947, AIA62357, AIA62348 | ||
| YP_009072442, AUM60029 | E7Q | N-terminal |
| QDF43841 | E7A | N-terminal |
| YP_009273007 | E7T | N-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958, | E8Q | N-terminal |
| AHY61342, AUM60029, ATQ39391 | ||
| QDF43841, YP_009273007 | E8D | N-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958, | T9I | N-terminal |
| AHY61342, AUM60029, ATQ39391 | ||
| AIA62348 | T11A | N-terminal |
| QDF43841, YP_009273007 | T11V | N-terminal |
| AIA62348 | F20S | TMD |
| ASL68947, AIA62357, ASL68958, AHY61342, | F20T | TMD |
| AUM60029, ATQ39391 | ||
| YP_009072442 | A22G | TMD |
| AIA62348, ASL68947, AIA62357, ASL68958, | F23C | TMD |
| AHY61342, AUM60029, ATQ39391, QDF43841, YP_009273007 | ||
| YP_009273007 | V25C | TMD |
| AIA62348, ASL68947, AIA62357, ASL68958, ATQ39391 | F26T | TMD |
| AKZ19089, YP_009072442, | T30A | TMD |
| AIA62348, ASL68947, AIA62357, ASL68958, AHY61342, AUM60029, ATQ39391 | T30C | TMD |
| QDF43841, YP_009273007 | T30G | TMD |
| QDF43841, YP_009273007 | L31C | TMD |
| QDF43841, YP_009273007 | T35L | TMD |
| YP_009072442 | A36C | TMD |
| ASL68947, AIA62357, ASL68958, AHY61342, AUM60029, ATQ39391 | L37T | C-terminal |
| QDF43841 | C40V | C-terminal |
| YP_009273007 | C40I | C-terminal |
| ASL68947, ASL68958 | A41M | C-terminal |
| AIA62348 | C44V | C-terminal |
| AIA62357, ASL68958, AHY61342, AUM60029 | C44A | C-terminal |
| ATQ39391 | C44I | C-terminal |
| AIA62357, ASL68958, AHY61342 | N45I | C-terminal |
| AUM60029 | N45V | C-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958, AHY61342, AUM60029, ATQ39391 | I46G | C-terminal |
| YP_009072442, AUM60029 | I46C | C-terminal |
| AIA62348 | V47C | C-terminal |
| YP_009072442 | N48D | C-terminal |
| YP_009072442, AUM60029 | N48F | C-terminal |
| YP_009072442 | V49Q | C-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958 | V49T | C-terminal |
| QDF43841 | V49N | C-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958, AHY61342, AUM60029, ATQ39391 | S50L | C-terminal |
| QDF43841 | S50I | C-terminal |
| QDF43841, YP_009273007 | V52C | C-terminal |
| AIA62348 | K53L | C-terminal |
| AIA62357 | K53V | C-terminal |
| YP_009072442 | V56R | C-terminal |
| QDF43841 | Y57L | C-terminal |
| YP_009072442 | S60L | C-terminal |
| ASL68958 | S60I | C-terminal |
| YP_009072442 | R61Q | C-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958, AHY61342, AUM60029, ATQ39391 | R61T | C-terminal |
| AIA62348, ASL68947, AIA62357, ASL68958, AHY61342, AUM60029, ATQ39391 | V62G | C-terminal |
| YP_009072442 | K63Q | C-terminal |
| YP_009072442 | N64A | C-terminal |
| YP_009273007 | L65D | C-terminal |
| QDF43841 | L65E | C-terminal |
| AIA62348, ASL68947, ASL68958, AHY61342, AUM60029, ATQ39391 | S67V | C-terminal |
| AIA62357 | S67F | C-terminal |
| QDF43841, YP_009273007 | S67L | C-terminal |
| ATO98160, AIA62280 | S68A | C-terminal |
| YP_009072442, AIA62348, ASL68947, AIA62357, ASL68958, | S68K | C-terminal |
| AHY61342, AUM60029, ATQ39391 | ||
| QDF43841, YP_009273007 | S68L | C-terminal |
| AGC74167 | E69V | C-terminal |
| ATO98135 | E69Q | C-terminal |
| AVP78044 | E69R | C-terminal |
| YP_009072442 | E69L | C-terminal |
| AIA62348, ASL68947, ASL68958, AHY61342, AUM60029, ATQ39391 | E69F | C-terminal |
| QDF43841, YP_009273007 | E69N | C-terminal |
| QDF43841, YP_009273007 | G70E | C-terminal |
| AIA62348, ASL68947, ASL68958, AHY61342, AUM60029, ATQ39391 | V71E | C-terminal |
| QDF43841, YP_009273007 | V71Q | C-terminal |
| AIA62348, ASL68947, ASL68958, AHY61342, AUM60029, ATQ39391 | P72S | C-terminal |
| AIA62357 | P72N | C-terminal |
| QDF43841, YP_009273007 | P72E | C-terminal |
| AIA62348, AIA62357 | L73D | C-terminal |
| ASL68947, ASL68958, AHY61342, AUM60029, ATQ39391 | L73E | C-terminal |
| QDF43841 | L73G | C-terminal |
Fig. 3Sequence alignment of the E protein of Camel CoV.
Fig. 4Sequence alignment of the E protein of Cat CoV.
Missense mutation of the envelope protein of the Cat CoV.
| AXE71624 | K51N, L81S | C-terminal |
| AEK25514 | W22L | TMD |
| ADO39821 | N48D | C-terminal |
| ACT10869 | V19G, R59C | TMD, C-terminal |
| ACT10909 | L81M | C-terminal |
| ACT10941 | V19G | TMD |
Fig. 5Sequence alignment of the E protein of Cattle CoV.
Protein ID and respective location of mutation of the E proteins over SARS-CoV-2.
| QKO24093 (USA: San Diego, California) | E8K | N-terminal | Acidic to Basic |
| QKU52835 (USA: WA) | E7Q | N-terminal | Acidic to Basic |
| QKN20885 (USA), QJQ84210 (USA: New Orleans, LA) | F26L | TMD | Hydrophobic to Hydrophobic |
| QKI36831 (China: Guangzhou) | D72Y | C-terminal | Hydrophilic to Hydrophobic |
| QKI36855 (China: Guangzhou) | S68C | C-terminal | Hydrophilic to Hydrophobic |
| QKG87268, QKG88576 (USA: Massachusetts) | S68F | C-terminal | Hydrophilic to Hydrophobic |
| QKE45838 (USA:CA), QKE45886 (USA:CA) | P71L | C-terminal | Hydrophobic to Hydrophobic |
| QKE45898 (USA:CA), QKE45910 (USA:CA) | P71L | C-terminal | Hydrophobic to Hydrophobic |
| QJE38284 (USA:CA), QIU81527 (USA:WA), QKV06741 (USA: WA) | P71L | C-terminal | Hydrophobic to Hydrophobic |
| QKU32371 (USA: CA) | P71L | C-terminal | Hydrophobic to Hydrophobic |
| QJS53352 (Greece: Athens) | L39M | TMD | Hydrophobic to Hydrophobic |
| QJR88103 (Australia: Victoria) | L73F | C-terminal | Hydrophobic to Hydrophobic |
| QJA42107 (USA: VA) | A36V | TMD | Hydrophobic to Hydrophobic |
| QHZ00381 (South Korea) | L37H | TMD | Hydrophobic to Hydrophilic |
| QKU31207 (USA: CA) | T9I | TMD | Hydrophilic to Hydrophobic |
| QKU37035 (Saudi Arabia: Jeddah) | L19F | TMD | Hydrophobic to Hydrophobic |
| QKV07065 (USA: WA) | S55F | C-terminal | Hydrophilic to Hydrophobic |
| QKU28584 (USA: WA) | A41S | C-terminal | Hydrophobic Hydrophilic |
Fig. 6Sequence alignment of the E protein of SARS-CoV-2.
Envelope proteins across different host CoVs.
| MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCNIVNVSLVKPSFYVYSRVKNLNSSRVPDLLV | 75 | |
| MFMADAYLADTVWYVGQIIFIVAICLLVTIVVVAFLATFKLCIQLCGMCNTLVLSPSIYVFNRGRQFYEFYNDIKPPVLDVDDV | 84 | |
| MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCNIVNVSLVKPSFYVYSRVKNLNSSRVPDLLV | 75 | |
| MMFPRAFTIIDDHGMVVSVFFWLLLIIILILFSIALLNVIKLCMVCCNLGKTIIVLPARHAYDAYKNFMHIKAYDPDEAFLV | 82 | |
| MLPFVQERIGLFIVNFFIFTVVCAITLLVCMAFLTATRLCVQCITGFNTLLVQPALYLYNTGRSVYVKFQDSKPPLPPDEWV | 82 | |
| MFMADAYFADTVWYVGQIIFIVAICLLVIIVVVAFLATFKLCIQLCGMCNTLVLSPSIYVFNRGRQFYEFYNDVKPPVLDVDDV | 84 | |
| MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCNIVNVSLVKPTVYVYSRVKNLNSSEGVPDLLV | 76 |
Fig. 7Sequence homology based phylogeny of the envelope protein of different host-CoVs.
Amino acid counts over the envelope proteins over the different host CoVs.
| 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| 6 | 2 | 3 | 6 | 4 | 3 | 1 | 3 | 0 | 8 | 9 | 2 | 3 | 7 | 3 | 2 | 4 | 1 | 5 | 12 | |
| 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| 7 | 2 | 3 | 5 | 3 | 0 | 1 | 2 | 3 | 11 | 11 | 4 | 5 | 7 | 3 | 2 | 2 | 1 | 3 | 7 | |
| 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 5 | 11 | 2 | 2 | 8 | 6 | 2 | 7 | 1 | 3 | 10 | |
| 6 | 2 | 3 | 6 | 4 | 3 | 1 | 3 | 0 | 8 | 8 | 2 | 3 | 8 | 3 | 2 | 3 | 1 | 5 | 13 | |
| 4 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 14 | 2 | 1 | 4 | 2 | 7 | 5 | 0 | 4 | 14 |
Fig. 8Phylogenetic relationship among the different host CoVs with respect to the amino acids conservation the envelope protein.
Fig. 9Phylogenetic relationship among envelope proteins of the different host CoVs with respect to the sequence based homology.
Frequency of amino acids over the envelope proteins across the seven different host-CoVs.
| ASL68947 | 7 | 2 | 3 | 1 | 4 | 5 | 3 | 3 | 0 | 6 | 8 | 2 | 3 | 7 | 6 | 3 | 6 | 1 | 3 | 9 | |
| ASU90554 | 4 | 3 | 3 | 2 | 4 | 3 | 2 | 3 | 1 | 4 | 11 | 2 | 3 | 8 | 6 | 2 | 7 | 1 | 3 | 10 | |
| ASL68958 | 7 | 2 | 2 | 1 | 4 | 5 | 3 | 3 | 0 | 7 | 8 | 2 | 3 | 7 | 6 | 3 | 6 | 1 | 3 | 9 | |
| ANI69894 | 4 | 3 | 3 | 1 | 4 | 4 | 2 | 3 | 1 | 4 | 11 | 2 | 3 | 8 | 6 | 2 | 7 | 1 | 3 | 10 | |
| AXE71624 | 7 | 2 | 4 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 10 | 3 | 5 | 7 | 3 | 3 | 3 | 1 | 3 | 7 | |
| ALA49346 | 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 4 | 11 | 2 | 3 | 7 | 6 | 3 | 7 | 1 | 3 | 10 | |
| AGT52084 | 7 | 2 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 11 | 4 | 5 | 6 | 3 | 3 | 3 | 1 | 3 | 7 | |
| AUM60029 | 6 | 2 | 3 | 1 | 4 | 6 | 2 | 3 | 0 | 6 | 8 | 2 | 3 | 7 | 6 | 3 | 5 | 1 | 3 | 11 | |
| ACT10909 | 7 | 3 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 10 | 11 | 3 | 6 | 6 | 3 | 2 | 3 | 1 | 3 | 8 | |
| AHY61342 | 7 | 2 | 3 | 1 | 4 | 5 | 3 | 3 | 0 | 7 | 9 | 2 | 2 | 7 | 6 | 3 | 4 | 1 | 3 | 10 | |
| AIA62348 | 7 | 2 | 2 | 1 | 5 | 4 | 3 | 3 | 3 | 6 | 8 | 1 | 1 | 6 | 6 | 3 | 4 | 1 | 3 | 13 | |
| ACT10941 | 7 | 2 | 3 | 4 | 3 | 1 | 1 | 3 | 2 | 11 | 12 | 4 | 5 | 6 | 3 | 2 | 3 | 1 | 3 | 6 | |
| AYF53097 | 7 | 2 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 10 | 11 | 4 | 5 | 7 | 3 | 2 | 3 | 1 | 3 | 8 | |
| QCI31474 | 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 4 | 11 | 2 | 3 | 8 | 6 | 2 | 7 | 1 | 3 | 10 | |
| ASU62492 | 7 | 2 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 11 | 4 | 5 | 7 | 3 | 2 | 3 | 1 | 3 | 7 | |
| ACT10974 | 6 | 2 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 11 | 4 | 5 | 7 | 3 | 2 | 3 | 1 | 3 | 8 | |
| ALA49390 | 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 4 | 12 | 2 | 3 | 7 | 6 | 2 | 7 | 1 | 3 | 10 | |
| ASU89926 | 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 5 | 11 | 2 | 2 | 8 | 6 | 2 | 7 | 1 | 3 | 10 | |
| ACT10869 | 7 | 1 | 3 | 4 | 4 | 1 | 1 | 3 | 2 | 11 | 12 | 4 | 5 | 6 | 3 | 2 | 3 | 1 | 3 | 6 | |
| AIA62357 | 6 | 3 | 5 | 1 | 4 | 3 | 3 | 3 | 1 | 8 | 8 | 1 | 1 | 8 | 6 | 1 | 6 | 1 | 3 | 10 | |
| ASU90334 | 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 4 | 11 | 2 | 2 | 8 | 6 | 2 | 7 | 1 | 3 | 10 | |
| ASU62503 | 7 | 2 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 12 | 4 | 5 | 6 | 3 | 2 | 3 | 1 | 3 | 7 | |
| ADO39821 | 7 | 2 | 2 | 5 | 3 | 1 | 1 | 2 | 2 | 11 | 11 | 4 | 5 | 7 | 3 | 2 | 3 | 1 | 3 | 7 | |
| QDM36990 | 7 | 2 | 4 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 11 | 4 | 4 | 7 | 3 | 2 | 2 | 1 | 3 | 8 | |
| ACT10858 | 7 | 2 | 4 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 12 | 4 | 5 | 6 | 3 | 2 | 2 | 1 | 3 | 7 | |
| AWW13513 | 6 | 2 | 3 | 6 | 4 | 3 | 1 | 3 | 0 | 8 | 9 | 2 | 3 | 7 | 3 | 2 | 4 | 1 | 5 | 12 | |
| ACT10920 | 7 | 2 | 4 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 11 | 3 | 5 | 7 | 2 | 2 | 2 | 1 | 3 | 7 | |
| ATQ39391 | 4 | 2 | 3 | 0 | 4 | 5 | 4 | 3 | 0 | 5 | 8 | 2 | 3 | 7 | 6 | 3 | 7 | 1 | 3 | 12 | |
| QBM11741 | 4 | 3 | 3 | 2 | 4 | 4 | 2 | 3 | 0 | 4 | 12 | 2 | 3 | 8 | 6 | 1 | 7 | 1 | 3 | 10 | |
| AVI15004 | 6 | 2 | 3 | 6 | 4 | 3 | 1 | 3 | 0 | 8 | 8 | 2 | 3 | 8 | 3 | 2 | 3 | 1 | 5 | 13 | |
| AUG98123 | 7 | 3 | 3 | 4 | 3 | 0 | 1 | 2 | 2 | 12 | 11 | 4 | 5 | 7 | 3 | 2 | 3 | 1 | 3 | 6 | |
| AMD11134 | 7 | 2 | 3 | 5 | 3 | 0 | 1 | 2 | 3 | 11 | 11 | 4 | 5 | 7 | 3 | 2 | 2 | 1 | 3 | 7 | |
| AEK25525 | 7 | 2 | 3 | 4 | 3 | 0 | 1 | 2 | 2 | 11 | 11 | 5 | 5 | 7 | 3 | 2 | 3 | 1 | 3 | 7 | |
| AVZ61113 | 6 | 2 | 3 | 6 | 4 | 3 | 1 | 3 | 0 | 8 | 8 | 2 | 2 | 7 | 3 | 2 | 3 | 1 | 5 | 13 | |
| ALA50082 | 6 | 2 | 3 | 6 | 4 | 3 | 1 | 3 | 0 | 9 | 8 | 2 | 2 | 8 | 3 | 2 | 3 | 1 | 5 | 13 | |
| AEK25514 | 7 | 2 | 3 | 4 | 3 | 1 | 1 | 2 | 2 | 11 | 12 | 4 | 5 | 7 | 3 | 2 | 3 | 0 | 3 | 7 | |
| YP_009072442 | 6 | 3 | 3 | 1 | 4 | 5 | 3 | 3 | 0 | 5 | 12 | 1 | 1 | 4 | 2 | 3 | 5 | 0 | 5 | 13 | |
| QDF43841 | 3 | 1 | 5 | 2 | 5 | 2 | 5 | 4 | 1 | 10 | 15 | 2 | 1 | 5 | 1 | 4 | 3 | 0 | 2 | 10 | |
| YP_009273007 | 2 | 0 | 3 | 2 | 6 | 2 | 4 | 2 | 0 | 7 | 12 | 3 | 1 | 4 | 1 | 5 | 6 | 0 | 4 | 12 | |
| QHZ00381 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 1 | 3 | 13 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| ATO98135 | 4 | 2 | 5 | 1 | 3 | 1 | 2 | 2 | 0 | 3 | 14 | 2 | 1 | 4 | 2 | 7 | 5 | 0 | 4 | 14 | |
| AIA62302 | 4 | 2 | 5 | 2 | 4 | 0 | 2 | 1 | 0 | 3 | 12 | 2 | 2 | 4 | 2 | 7 | 5 | 0 | 4 | 15 | |
| QJS53352 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 13 | 2 | 2 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| QDF43816 | 4 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 4 | 14 | 2 | 1 | 4 | 2 | 7 | 5 | 0 | 4 | 13 | |
| ABD75324 | 4 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 13 | 2 | 1 | 5 | 2 | 7 | 5 | 0 | 4 | 14 | |
| ADK66843 | 4 | 2 | 4 | 0 | 3 | 1 | 4 | 1 | 0 | 3 | 13 | 2 | 1 | 6 | 2 | 8 | 5 | 0 | 4 | 13 | |
| ATO98160 | 5 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 14 | 2 | 1 | 4 | 2 | 6 | 5 | 0 | 4 | 14 | |
| AIA62280 | 5 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 13 | 2 | 1 | 4 | 2 | 6 | 5 | 0 | 4 | 15 | |
| QJR88103 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 13 | 2 | 1 | 6 | 2 | 8 | 4 | 0 | 4 | 13 | |
| AKZ19089 | 5 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 14 | 2 | 1 | 4 | 2 | 7 | 4 | 0 | 4 | 14 | |
| QDF43821 | 4 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 14 | 2 | 1 | 4 | 2 | 7 | 5 | 0 | 4 | 14 | |
| QKI36855 | 4 | 3 | 5 | 1 | 4 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 7 | 4 | 0 | 4 | 13 | |
| AIA62312 | 4 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 13 | 2 | 1 | 4 | 2 | 7 | 5 | 0 | 4 | 15 | |
| ABD75313 | 4 | 2 | 5 | 2 | 4 | 0 | 2 | 1 | 0 | 3 | 13 | 2 | 1 | 4 | 2 | 7 | 5 | 0 | 4 | 15 | |
| QKG87268 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 6 | 2 | 7 | 4 | 0 | 4 | 13 | |
| AHX37560 | 4 | 2 | 4 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 14 | 2 | 1 | 4 | 2 | 8 | 5 | 0 | 4 | 14 | |
| AGC74167 | 4 | 2 | 5 | 1 | 3 | 0 | 2 | 2 | 0 | 3 | 13 | 2 | 1 | 5 | 2 | 7 | 5 | 0 | 4 | 15 | |
| AVP78044 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| QIG55947 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| YP_009724392 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| QJR89447 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 12 | |
| QJQ84210 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 15 | 2 | 1 | 4 | 2 | 8 | 4 | 0 | 4 | 13 | |
| QJA42107 | 3 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 14 | |
| ATO98184 | 4 | 2 | 5 | 1 | 3 | 0 | 3 | 2 | 0 | 3 | 15 | 2 | 1 | 4 | 1 | 7 | 5 | 0 | 4 | 14 | |
| QKE45838 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 15 | 2 | 1 | 5 | 1 | 8 | 4 | 0 | 4 | 13 | |
| QKI36831 | 4 | 3 | 5 | 0 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 5 | 13 | |
| QJI54124 | 4 | 3 | 5 | 0 | 3 | 0 | 2 | 1 | 0 | 3 | 13 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 | |
| QKU31207 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 4 | 14 | 2 | 1 | 5 | 2 | 8 | 3 | 0 | 4 | 13 | |
| QKU37035 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 13 | 2 | 1 | 6 | 2 | 8 | 4 | 0 | 4 | 13 | |
| QKV07065 | 4 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 6 | 2 | 7 | 4 | 0 | 4 | 13 | |
| QKU28584 | 3 | 3 | 5 | 1 | 3 | 0 | 2 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 9 | 4 | 0 | 4 | 13 | |
| QKU52835 | 4 | 3 | 5 | 1 | 3 | 1 | 1 | 1 | 0 | 3 | 14 | 2 | 1 | 5 | 2 | 8 | 4 | 0 | 4 | 13 |
Fig. 10Phylogenetic relationship among envelope proteins of the different host CoVs with respect to the amino acids conservation.
Shannon entropy of the amino acid conservation of the E protein of the host CoVs.
| ASL68947 | Bat CoV | 0.933 | ACT10858 | Feline CoV | 0.919 | AIA62280 | 0.851 | |
| ASU90554 | Camel CoV | 0.932 | AWW13513 | Chimpanzee CoV | 0.918 | QJR88103 | 0.850 | |
| ASL68958 | Bat CoV | 0.930 | ACT10920 | Feline CoV | 0.916 | QKU37035 | 0.850 | |
| ANI69894 | Camel CoV | 0.929 | ATQ39391 | Bat CoV | 0.916 | AKZ19089 | 0.850 | |
| AXE71624 | Feline CoV | 0.928 | QBM11741 | Camel CoV | 0.915 | QDF43821 | 0.850 | |
| ALA49346 | Camel CoV | 0.927 | AVI15004 | Bovine CoV | 0.914 | QKI36855 | 0.850 | |
| AGT52084 | Feline CoV | 0.926 | AUG98123 | Feline CoV | 0.912 | AIA62312 | 0.849 | |
| AUM60029 | Bat CoV | 0.926 | AMD11134 | Feline CoV | 0.912 | ABD75313 | 0.848 | |
| ACT10909 | Feline CoV | 0.926 | AEK25525 | Feline CoV | 0.912 | QKG87268 | 0.848 | |
| AHY61342 | Bat CoV | 0.925 | AVZ61113 | Bovine CoV | 0.912 | QKV07065 | 0.848 | |
| AIA62348 | Bat CoV | 0.925 | ALA50082 | Camel CoV | 0.909 | AHX37560 | 0.847 | |
| ACT10941 | Feline CoV | 0.924 | AEK25514 | Feline CoV | 0.908 | AGC74167 | Bat CoV | 0.847 |
| AYF53097 | Feline CoV | 0.924 | YP_009072442 | Bat CoV | 0.888 | AVP78044 | Bat CoV | 0.846 |
| QCI31474 | Camel CoV | 0.923 | QDF43841 | Bat CoV | 0.881 | QIG55947 | 0.846 | |
| ASU62492 | Feline CoV | 0.922 | YP_009273007 | 0.868 | YP_009724392 | 0.846 | ||
| ACT10974 | Feline CoV | 0.922 | QHZ00381 | 0.862 | QKU31207 | 0.846 | ||
| ALA49390 | Camel CoV | 0.921 | ATO98135 | 0.858 | QJR89447 | 0.843 | ||
| ASU89926 | Camel CoV | 0.921 | AIA62302 | 0.857 | QKU28584 | 0.842 | ||
| ACT10869 | Feline CoV | 0.920 | QJS53352 | 0.856 | QJQ84210 | 0.841 | ||
| AIA62357 | Bat CoV | 0.920 | QDF43816 | 0.856 | QJA42107 | 0.840 | ||
| ASU90334 | Camel CoV | 0.920 | ABD75324 | Bat CoV | 0.855 | ATO98184 | 0.840 | |
| ASU62503 | Feline CoV | 0.920 | ADK66843 | 0.852 | QKE45838 | 0.836 | ||
| ADO39821 | Feline CoV | 0.920 | QKU52835 | 0.852 | QKI36831 | 0.835 | ||
| QDM36990 | Feline CoV | 0.919 | ATO98160 | 0.851 | QJI54124 | 0.824 |
Fig. 11Phylogenetic relationship among envelope proteins of the SARS-CoV2, Bat and Pangolin CoVs with respect to the amino acids conservation.