| Literature DB >> 33809265 |
John T Hancock1, Ros C Rouse2, Emma Stone1, Alexander Greenhough1.
Abstract
COVID-19, caused by SARS-CoV-2, is a world-wide problem for the human population. It is known that some animal species, such as mink, can become infected and transmit the virus. However, the susceptibility of most animals is not known. Here, we review the use of sequence analysis of the proteins which are known to interact with SARS-CoV-2 as a way to estimate an animal's susceptibility. Although most such work concentrates on the angiotensin-converting enzyme 2 receptor (ACE2), here TMPRSS2 (Transmembrane Serine Protease 2), neuropilin-1 and furin are also considered. Polymorphisms, especially ones which are known to alter viral/host interactions are also discussed. Analysis of ACE2 and TMPRSS2 protein sequences across species suggests this approach may be of some utility in predicting susceptibility; however, this analysis fails to highlight some susceptible animals such as mink. However, combined with observational data which emerges over time about which animals actually become infected, this may, in the future, be a useful tool to assist the management of risks associated with human/animal contact and support conservation and animal welfare measures.Entities:
Keywords: ACE2; COVID-19; SARS-CoV-2; TMPRSS2; furin; neuropilin-1; polymorphisms
Year: 2021 PMID: 33809265 PMCID: PMC8000148 DOI: 10.3390/ani11030797
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 2.752
Animal species and Accession Numbers of the sequences used for the amino acid alignments shown in the Figures. Data were obtained from The National Center for Biotechnology Information (NCBI). X#, I# or V# indicates the isoforms chosen. N/A: not available from NCBI; * indicates that for the furin sequence this was from a Bos indicus × Bos taurus (hybrid cattle).
| Common Name | Animal | ACE2 | TMPRSS2 | Neuropilin-1 | Furin |
|---|---|---|---|---|---|
| Human |
| BAB40370.1 | V1: NM_001135099.1 | X1: NM_003873.7 | I1: NP_001276753.1 |
| V2: NM_005656.4 | X6: XM_017016866.2 | I2: NP_001369551.1 | |||
| Gorilla |
| XP_018874749.1 | X1: XM_019017901.2 | X1: XM_004049248.3 | XP_018866183.1 |
| X2: XM_004062839.3 | |||||
| Macaque |
| XP_024647450.1 | XM_011725990.2 | X1: XM_011738246.1 | XP_011750505.1 |
| Lemur (Grey mouse) |
| XP_020140826.1 | XM_012749836.2 | X1: XP_020139780.1 | XP_012618739.1 |
| Camel |
| XP_006194263.1 | XM_032461838.1 | X1: XP_032329703.1 | X1: XP_032325184.1 |
| Horse |
| XP_001490241.1 | X1: XM_005606160.3 | X1: XM_023632110.1 | XP_023505390.1 |
| X2: XM_014736385.2 | X2: XM_001915381.4 | ||||
| African elephant |
| XP_023410960.1 | XM_023559111.1 | XM_023539786.1 | XM_003413876.3 |
| Goat |
| AHI85757.1 | X1: XM_013966670.2 | X1: XP_017912559.1 | XP_017921808.1 |
| Cow | X1: XP_024843618.1 | X1: XM_015473655.2 | X1: XP_024856535.1 | XP_027377940.1 | |
| Sheep |
| XP_011961657.1 | XM_027960704.1 | X1: XP_027832377.1 | CAJ29337.1 |
| Pig |
| X1: XP_020935033.1 | NM_001386131.1 | XP_020920629.1 | X1: XP_020954578.1 |
| Black rat |
| XP_032746145.1 | XM_032900351.1 | XP_032743292.1 | XP_032749231.1 |
| Mouse |
| EDL40761.1 | NM_015775.2 | AAH60129.1 | CAA37988.1 |
| Rabbit |
| QHX39726.1 | NM_001386128.1 | XP_008272962.1 | XP_002721548.2 |
| Cat (domestic) |
| AAX59005.1 | X1: XM_023238709.1 | X1: XP_006933414.1 | XP_023110662.1 |
| X2: XM_023238710.1 | X2: XP_003988264.1 | ||||
| X3: XM_023238711.1 | X3: XP_003988265.1 | ||||
| X4: XP_006933415.1 | |||||
| Bear (grizzly) |
| XP_026333866.1 | X1: XM_026499970.1 | X1: XM_026483469.1 | XM_026510505.1 |
| Tiger (Siberian) |
| XP_007090142.1 | XM_015541202.1 | XP_007092922.2 | XP_015391216.1 |
| Dog (domestic) |
| X1: XP_013966804.1 | XM_022413273.1 | X1: XP_005617003.1 | XP_022272656.1 |
| X2: XP_535142.3 | |||||
| American mink |
| CCP86723.1 | N/A | CCP78142.1 | CCP82836.1 |
| Ferret |
| BAE53380.1 | XM_013061267.1 | X1: XP_004774343.2 | X1: XP_004763758.1 |
| Ground squirrel |
| XP_005316051.3 | XM_021725256.1 | X1: XM_005330824.1 | X1: XP_021578591.1 |
| Kuhl’s pipistrelle bat |
| XP_036295422.1 | XM_036446834.1 | X1: XP_036309963.1 | XP_036271438.1 |
| Little brown bat |
| X1: XP_023609438.1 | XM_006104440.3 | XP_023607944.1 | N/A |
| Beluga whale |
| XP_022418360.1 | X1: XM_022552959.1 | X1: XP_030617644.1 | X1: XP_022419719.1 |
| Grey seal |
| XP_035963182.1 | X1: XM_036108025.1 | X1: XP_035974500.1 | X1: XP_035951550.1 |
| Bottlenosed dolphin |
| XP_019781177.2 | XM_033856384.1 | X1: XP_033708297.1 | XP_019801399.1 |
| Cod (fish) |
| XP_030232530.1 | N/A | X1: XP_030204747.1 | N/A |
| Arkansas darter (fish) |
| XP_034732342.1 | XM_034889577.1 | N/A | a: XP_034734947.1 |
| b: XP_034734346.1 | |||||
| Rainbow trout (fish) |
| N/A | X1: XM_021618133.2 | N/A | X1: XP_021428901.2 |
| Chicken (domestic) |
| QEQ50331.1 | X1: XM_416737.6 | X1: XP_015136776.1 | NP_990046.1 |
| Duck |
| XP_012949915.2 | X1: XM_027448371.1 | X1: XP_027306331.1 | X1: XP_027321808.1 |
| Rock dove |
| XP_021154486.1 | XM_021282586.1 | PKK32102.1 | XP_021155000.1 |
| PKK30539.1 | |||||
| Tiger snake |
| XP_026530754.1 | X1: XM_026673992.1 | X1: XP_026528008.1 | X1: XP_026540426.1 |
| African clawed frog |
| XP_018104311.1 | V1: NM_001087958.1 | NP_001081380.1 | AAW83022.1 |
| Sloth |
| N/A | N/A | N/A | N/A |
| African lion |
| N/A | N/A | N/A | N/A |
| Indian elephant |
| N/A | N/A | N/A | N/A |
Amino acids deemed to be important in selected proteins and polymorphisms which have been found to be significant. Glycosylated residues highlighted in red. Note: Senapati et al. [49] is a preprint at the time of writing this manuscript.
| Protein | Reference(s)/Comments | List of Amino Acids Thought to Be Important for Viral/Host Interactions | Polymorphisms of Interest | Comments and Refs for PMs |
|---|---|---|---|---|
| ACE2 | [ | S19, Q24, T27, F28, D30, K31, H34, E35, E37, D38, Y41, Q42, L45, L79, M82, Y83, N330, K353, G354, D355, R357, R393 | S19P, T27A, K31R, N33I, H34R, E35K, E37K, D38V, Y50F, N51S, M62V, K68E, F72V, Y83H, E329G, G352V, D355N, Q388L, P389H, D509Y | Decreased S protein affinity: [ |
| [ | Q24, K31, H34, E35, D38, Y41, Q42, N53, L79, M82, Y83, N90, N322, Q325, E329, N330, K353, R652, R710 | I21V, E23K, K26R, T27A, N64K, T92I, Q102P, D206G, G211R, R219C, G326E, K341R, H378R, V447F, I468V, A501T, R559S | Increased S protein affinity: [ | |
| [ | S19, Q24, T27, F28, K31, H34, E35, E37, D38, Y41, Q42, L45, L79, M82, Y83, E329, N330, K353, G354, D355, R357 | E300Ter, A627V, N638S, L656Ter, S692P, N720D, L731I/F, E668L | Affinity not reported [ | |
| [ | S19, Q24, T27, F28, D30, K31, H34, E35, E37, D38, Y41, Q42, L45, | M82I, E329G, D355N, R652K, R710C, R710H: No other significant polymorphisms found | from gnomAD | |
| [ | Q24, F28, D30, K31, H34, E35, E37, D38, Y41, Q42, L79, M82, Y83, K353, G354, D355, R357 | S19P (increases affinity), K26R (decreases affinity) | [ | |
| [ | Q24, D30, H34, E37, D38, Y41, Q42, M82, Y83, K353, D355, R357 | S19P, I21V, E23K, K26R, T27A, N64K, T92I, Q102P, H378R | Increase [ | |
| [ | E23, Q24, K26, T27, D30, K31, H34, E35, E37, D38, Y41, Q42, K68, L79, M82, Y83, D206, G211, R219, K317, G326, E329, K341, G352, K353, D355, R357, P389, V447, I468, R559, D442, N437, T478, F486, | K31R, N33I, H34R, E35K, E37K, D38V, Y50F, N51S, M62V, K68E, F72V, Y83H, G326E, G352V, D355N, Q388L, D509Y | Decrease [ | |
| [ | 353-KGDFR-357 | E37K, G352V, D355N (38 others mentioned) | Decrease [ | |
| [ | R and K residues within 697–716 needed for cleavage | T27R, G326E | Increase [ | |
| S19P, E329G | Decrease [ | |||
| K26R (decrease). Increasing affinity in order: I468V, R219C, K341R, D206G, G211R | [ | |||
| TMPRSS2 | [ | D435 | V160M, G181R, R240C, G259S, P335L, G432A, D435Y | [ |
| [ | G190, N192, P191, F195, Y189, S234, K399, D396, N395, T324, A280, C278, R277, F251, I279, F394, Y232, N284, V283, Q290, L285, P325, N488, S287, S288, I489, N286, T393 | G8V, R255S, S441G | Decreased S protein affinity: [ | |
| V197M/V160M, A65T/A28T | Increased S protein affinity: [ | |||
| G190R, P191Q, Y189C, S234G, D396N, T324N, A280D, R277H, R277P, R277C, I279T, I279V, F394S, N284K, N284S, P325T, I489T, N286Y, | from gnomAD | |||
| Neuropilin-1 | [ | Y297, W301, T316, D320, S346, E348, T249, Y353 | T249S (rare), V179A, R563Q, D592N (non-canonical), V733I | from gnomAD |
| No polymorphisms matching other highlighted amino acids | ||||
| Furin | [ | H194, W254, N295, S368 (b-strand at S253-P256), T309, S316 | T309I (rare) | from gnomAD |
| No polymorphisms matching other highlighted amino acids |
Figure 1Amino acid sequence alignment of ACE2 polypeptides from a range of animal species: a short section encompassing two of the glycosylation sites is shown here. Full sequence alignments are shown in Supplementary Figure S1. Accession Numbers of the polypeptide sequences used are given in Table 1. Alignments were carried out using Clustal Omega [45]. Human sequence is highlighted in grey to aid comparison to other sequences. Important amino acids used for interaction with the SARS-CoV-2 viral proteins as reported by Damas et al. [41] (Table 2) are highlighted in yellow, with the highlighted glycosylated residues in blue.
Figure 2The alignment of the cleavage site of ACE2 (region 697–716). Human cleavage site is highlighted in red. Significant arginine and lysine residues are in blue with differences in animal species shown in brown. Grey highlights indicate human sequence.
Figure 3Amino acid sequence alignment of TMPRSS2 polypeptides from a range of animal species: a short section is shown here, whilst the full sequence alignments are shown in Supplementary Figure S2. Accession Numbers of the polypeptide sequences used are given in Table 1. Alignments were carried out using Clustal Omega [45]. Human sequence is highlighted in grey to aid comparison to other sequences. Polymorphisms reported by Hou et al. [20] are in purple. Amino acids involved in the TMPRSS2/SARS-CoV-2 interactions are highlighted in yellow [49].
Figure 4Alignment of human and ferret TMPRSS2 sequences, including the R277-Q290 region. (where * indicates consensus, : indicates conserved change, . is less conserved and a gap in analysis indicates no conservation, as deemed from amino acid characteristics).
Figure 5Amino acid sequence alignment of neuropilin-1 polypeptides from a range of animal species: a short section is shown here, whilst the full sequence alignments are shown in Supplementary Figure S3. Accession Numbers of the polypeptide sequences used are given in Table 1. Alignments were carried out using Clustal Omega [45]. The last row shown is the consensus analysis given by the algorithm. Human sequence (variant 2) is highlighted in grey to aid comparison to other sequences. Important amino acids used for interaction with the SARS-CoV-2 viral proteins [14] are highlighted in yellow. Amino acid differences from the human sequence in this region are highlighted in blue.
Figure 6Amino acid sequence alignment of a region of the furin polypeptides from a range of animal species: a short section is shown here, whilst the full sequence alignments are shown in Supplementary Figure S4. Accession Numbers of the polypeptide sequences used are given in Table 1. Alignments were carried out using Clustal Omega [45]. The last row shown is the consensus analysis given by the algorithm. Human sequences are highlighted in grey to aid comparison to other sequences. Important amino acids used for catalytic site of the enzyme, as discussed [62] are highlighted in yellow, with the S253-P255 region in green. Differences in sequences from the human sequence are shown in blue.
Figure 7Phylogenetic analysis of the animals using the alignment sequences. Sequences as listed in Table 1 were analysed by Clustal Omega [45] and then relevant sections of the phylogenetic analysis shown here. A: analysis of the ACE2 sequences; B: analysis of the TMPRSS2 sequences.
A summary of pertinent points about the proteins analyzed.
| Protein of Interest | Comments and Conclusions |
|---|---|
| ACE2 |
Key protein for viral interactions Several human polymorphisms identified which alter viral susceptibility, either causing an increase or decrease Sequence similarities between animals and human has been used to try to predict susceptibility of vertebrates No polymorphisms which alter viral susceptibility in humans found to correlate to susceptibility in animals to date Sequence variants do not correlate with reported SARS-CoV-2 susceptibility to infection in animals More work on mustelids would be interesting |
| TMPRSS2 |
Some differences in sequences across animal groups Several human polymorphisms identified which alter viral susceptibility Sequence variants do not correlate with reported susceptibility to SARS-CoV-2 infection in animals |
| Neuropilin-1 |
Little variation across animal species and with humans Limited or no use for predicting animal susceptibility to SARS-CoV-2 |
| Furin |
Little variation across animal species and with humans Limited or no use for predicting animal susceptibility to SARS-CoV-2 |
A summary of the susceptibility predictions for a selected range of animals and how this correlates to real-life reporting to-date.
| Animal Group | Susceptibility Predicted | Real-Life Outcomes Reported/Comments |
|---|---|---|
| Non-human primates | Very high—sequences from several species very similar to humans |
Some concern [ Only one case reported: gorillas at San Diego zoo [ |
| Marine mammals | High |
Some concerns, especially as sea water can be contaminated [ No reports to-date of positive animals |
| Cats | Medium |
Several reports of cats testing positive [ Several big cats testing positive, including tigers, lions, snow leopards and a cougar [ Some concerns that cats can transmit the virus [ |
| Dogs | Medium |
Dogs found to be virus positive [ Low expression of ACE2 in respiratory tract [ No reports of dogs transmitting the virus |
| Mustelids | Low |
Large numbers of animals tested positive Large number of animals culled Can transmit virus back to humans [ Mutation of virus in mustelids First case of a wild animal being virus positive [ |
| Marine mammals | Low |
Predicted using the ACE2 receptor [ No reports of animals testing positive |
| Bats | Low/very low |
37 chiropteran species predicted susceptibility as low or very low [ No cases of bats being SARS-CoV-2 positive to date Experimentally fruit bats ( Focus on bats as being potential original source, but no conclusive evidence to date |
| Birds/reptiles/amphibians/fish | Very low |
No cases of any of these groups being SARS-CoV-2 positive to date |