| Literature DB >> 32826334 |
Joana Damas1, Graham M Hughes2, Kathleen C Keough3,4, Corrie A Painter5, Nicole S Persky6, Marco Corbo1, Michael Hiller7,8,9, Klaus-Peter Koepfli10, Andreas R Pfenning11, Huabin Zhao12,13, Diane P Genereux14, Ross Swofford14, Katherine S Pollard4,15,16, Oliver A Ryder17,18, Martin T Nweeia19,20,21, Kerstin Lindblad-Toh14,22, Emma C Teeling2, Elinor K Karlsson14,23,24, Harris A Lewin25,26,27.
Abstract
The novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of COVID-19. The main receptor of SARS-CoV-2, angiotensin I converting enzyme 2 (ACE2), is now undergoing extensive scrutiny to understand the routes of transmission and sensitivity in different species. Here, we utilized a unique dataset of ACE2 sequences from 410 vertebrate species, including 252 mammals, to study the conservation of ACE2 and its potential to be used as a receptor by SARS-CoV-2. We designed a five-category binding score based on the conservation properties of 25 amino acids important for the binding between ACE2 and the SARS-CoV-2 spike protein. Only mammals fell into the medium to very high categories and only catarrhine primates into the very high category, suggesting that they are at high risk for SARS-CoV-2 infection. We employed a protein structural analysis to qualitatively assess whether amino acid changes at variable residues would be likely to disrupt ACE2/SARS-CoV-2 spike protein binding and found the number of predicted unfavorable changes significantly correlated with the binding score. Extending this analysis to human population data, we found only rare (frequency <0.001) variants in 10/25 binding sites. In addition, we found significant signals of selection and accelerated evolution in the ACE2 coding sequence across all mammals, and specific to the bat lineage. Our results, if confirmed by additional experimental data, may lead to the identification of intermediate host species for SARS-CoV-2, guide the selection of animal models of COVID-19, and assist the conservation of animals both in native habitats and in human care.Entities:
Keywords: ACE2; COVID-19; SARS-CoV-2; comparative genomics; species conservation
Mesh:
Substances:
Year: 2020 PMID: 32826334 PMCID: PMC7486773 DOI: 10.1073/pnas.2010146117
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Cross-species conservation of ACE2 at the known binding residues and predictions of SARS-CoV-2 S-binding propensity. Species are sorted by binding scores. The ID column depicts the number of amino acids identical to human binding residues. Bold amino acid positions (also labeled with asterisks) represent residues at binding hot spots and constrained in the scoring scheme. Each amino acid substitution is colored according to its classification as nonconservative (orange), semiconservative (yellow), or conservative (blue), as compared to the human residue. Bold species names depict species with threatened IUCN risk status. The 410 vertebrate species dataset is available in Dataset S1.
Fig. 2.Cross-species conservation of ACE2 at the known binding residues and predictions of SARS-CoV-2 S-binding propensity. Species are sorted by binding scores. The ID column depicts the number of amino acids identical to human binding residues. Bold amino acid positions (also labeled with asterisks) represent residues at binding hot spots and constrained in the scoring scheme. Each amino acid substitution is colored according to its classification as nonconservative (orange), semiconservative (yellow), or conservative (blue), as compared to the human residue. Bold species names depict species with threatened IUCN risk status. The 410 vertebrate species dataset is available in Dataset S1.
Fig. 3.Congruence between binding score and the structural homology analysis. Species predicted with very high (red) or high binding scores (orange) have significantly fewer amino acid substitutions rated as potentially altering the binding interface between ACE2 and SARS-CoV-2 using protein structural analysis when compared to species with low (green) or very low (blue) binding scores. The more severe unfavorable variants are counted on the y axis and less severe weaken variants on the x axis. Black numerical labels indicate species count.
Fig. 4.Residues at the binding interface between ACE2 and SARS-CoV-2 S are under positive selection (CodeML analysis). In the SARS-CoV-2 spike protein RBD (light teal), this includes three positively selected residues (green, labeled with two asterisks). In ACE2 (wheat-colored, with binding interface residues in yellow), selected residues occur both outside the binding interface (dark blue) and inside the binding interface (red, labeled with one asterisk). (A) Positively selected residues in all mammals, including two at the binding interface. (B) A with 90° rotation. (C) Positively selected residues in the Chiroptera lineage, including five at the binding interface. (D) C with 90° rotation.