| Literature DB >> 29355607 |
Stefania Leopardi1, Edward C Holmes2, Michele Gastaldelli3, Luca Tassoni3, Pamela Priori4, Dino Scaravelli4, Gianpiero Zamperin3, Paola De Benedictis3.
Abstract
Coronaviruses (CoVs) have been documented in almost every species of bat sampled. Bat CoVs exhibit both extensive genetic diversity and a broad geographic range, indicative of a long-standing host association. Despite this, the respective roles of long-term virus-host co-divergence and cross-species transmission (host-jumping) in the evolution of bat coronaviruses are unclear. Using a phylogenetic approach we provide evidence that CoV diversity in bats is shaped by both species richness and their geographical distribution, and that CoVs exhibit clustering at the level of bat genera, with these genus-specific clusters largely associated with distinct CoV species. Co-phylogenetic analyses revealed that cross-species transmission has been more common than co-divergence across coronavirus evolution as a whole, and that cross-species transmission events were more likely between sympatric bat hosts. Notably, however, an analysis of the CoV RNA polymerase phylogeny suggested that many such host-jumps likely resulted in short-term spill-over infections, with little evidence for sustained onward transmission in new co-roosting host species.Entities:
Keywords: Bats; Coronaviruses; Cross-species transmission; Evolution; Phylogeny co-divergence; Virus
Mesh:
Year: 2018 PMID: 29355607 PMCID: PMC7106311 DOI: 10.1016/j.meegid.2018.01.012
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Host association, length and classification of RdRp and spike protein CoV sequences used in this study.
| Host superfamily | Host family | Host genus | Host species | RdRp (α; β) | RdRp > 816 bp | Spike (α; β) | RdRp + Spike (α; β) | RdRp clusters (α; β) |
|---|---|---|---|---|---|---|---|---|
| Emballonuroidea | Emballonuridae | 1 | 2 (1; 1) | 0 | 0 | |||
| Molossoidea | Molossidae | 2 | 3 (2; 1) | 3 | 3 (2; 1) | 3 (2; 1) | 0 | |
| Molossoidea | Molossidae | 1 | 2 (2; 0) | 0 | 1 (1; 0) | |||
| Noctilionoidea | Phillostomidae | 2 | 10 (9; 1) | 7 | 2 (2; 0) | |||
| Noctilionoidea | Phillostomidae | 1 | 11 (10; 1) | 8 | 1 (1; 0) | |||
| Noctilionoidea | Phillostomidae | 1 | 3 (3; 0) | 0 | 1 (1; 0) | |||
| Pteropodidae | Pteropodidae | 2 | 13 (0; 13) | 0 | 1 (0; 1) | |||
| Pteropodidae | Pteropodidae | 1 | 3 (0; 3) | 0 | 1 (0; 1) | |||
| Pteropodidae | Pteropodidae | 1 | 60 (1; 59) | 1 | 1 (0;1) | 1 (0;1) | 1 (0; 1) | |
| Pteropodidae | Pteropodidae | 1 | 4 (0; 4) | 0 | 0 | |||
| Pteropodidae | Pteropodidae | 1 | 5 (2; 3) | 0 | 1 (0; 1) | |||
| Pteropodidae | Pteropodidae | 1 | 6 (0; 6) | 0 | 1 (0; 1) | |||
| Pteropodidae | Pteropodidae | 2 | 18 (3; 15) | 14 | 13 (2; 11) | 12 (2; 10) | 2 (0; 2) | |
| Rhinolophoidea | Hipposideridae | 10 | 65 (55; 10) | 45 | 21 (19; 2) | 12 (10;2) | 3 (2; 1) | |
| Rhinolophoidea | Hipposideridae | 1 | 4 (4; 0) | 4 | 2 (2; 0) | |||
| Rhinolophoidea | Nycteridae | 1 | 3 (0; 3) | 3 | 1 (0; 1) | 1 (0; 1) | 1 (0; 1) | |
| Rhinolophoidea | Rhinolophidae | 13 | 87 (24; 63) | 61 | 74 (8; 66) | 29 (4; 25) | 3 (2;1) | |
| Vespertilionoidea | Miniopteridae | 7 | 84 (84; 0) | 34 | 30 (30; 0) | 14 (14;0) | 2 (2; 0) | |
| Vespertilionoidea | Vespertilionidae | 3 | 14 (3; 11) | 1 | 2 (1; 1) | |||
| Vespertilionoidea | Vespertilionidae | 1 | 6 (6; 0) | 6 | 2 (1; 1) | 0 | 1 (1; 0) | |
| Vespertilionoidea | Vespertilionidae | 16 | 80 (77; 3) | 8 | 6 (6; 0) | 5 (5; 0) | 8 (8; 0) | |
| Vespertilionoidea | Vespertilionidae | 3 | 7 (6; 1) | 2 | 1 (1;0) | 0 | 1 (1; 0) | |
| Vespertilionoidea | Vespertilionidae | 5 | 37 (9; 28) | 20 | 22 (0; 22) | 12 (0; 12) | 6 (3; 3) | |
| Vespertilionoidea | Vespertilionidae | 2 | 7 (4; 3) | 1 | 3 (3; 0) | 1 (1; 0) | 1 (1; 0) | |
| Vespertilionoidea | Vespertilionidae | 1 | 7 (0; 7) | 7 | 22 (0; 22) | 4 (0; 4) | 1 (0; 1) |
Fig. 1Host association and geographical distribution of the CoV sequences analyzed here. Countries within large-scale geographical regions are colored according to the number of CoV analyzed. Pie charts indicate the host-association of the CoV sequences included within each geographical area, with the colors indicating the different families of bat hosts. The map was built using mapchart (https://mapchart.net).
Fig. 2Phylogenetic overview of CoV sequences analyzed here. The tree reflects a Bayesian analysis of 935 bp of the RdRp gene (data set RdRp_CoV_1), rooted using two sequences from gamma coronaviruses (GenBank accession numbers EF584911-2). Genus specific clusters identified in our study are colored based on the host genus, as indicated. Posterior probabilities >0.90 supporting each cluster are shown. Branch lengths are scaled according to the number of substitutions per site. The three bars around the tree show the frequency within each cluster of (i) host genera, (ii) host species and (iii) sampling locations, from the innermost to the most exterior. Sequences showing characters with frequency < 10%, between 10 and 50%, and >50% are colored black, grey and light grey, respectively. For the “host species” bar, only sequences belonging to the host genus characterizing the cluster (frequency > 50%) have been colored; sequences associated with hosts only characterized at the genus level are indicated in yellow. The ICTV classification of virus clusters is indicated when available. The figure was generated using iTOL.
Summary of CoV sequences included in RdRp genus-specific phylogenetic clusters. For each cluster, the table indicates the best represented host genus (>50%), host family, number of host species, sampling location, length of the longest RdRp sequence and the presence of a corresponding cluster in the spike protein sequences.
| Cluster | Host genus | Host family | Host species | Sampling location | RdRp max length | Spike protein |
|---|---|---|---|---|---|---|
| C1α | Phillostomidae | 1 | LAM | 393 bp | ||
| C2α | Rhinolophidae | 2 | AS, SEA | 935 bp | X (AS) | |
| C3α | Phillostomidae | 1 | LAM | 935 bp | ||
| C4α | 2 | LAM | 816 bp | |||
| C5α | 2 | LAM | 816 bp | |||
| C6α | Molossidae | 1 | LAM | 393 bp | ||
| C7α | Vespertilionidae | 4 | AS, EU | 935 bp | X (AS) | |
| C8α | Rhinolophidae | 3 | AFR, EU | 816 bp | ||
| C9α | Hipposideridae | 4 | AS, SEA | 935 bp | X (AS) | |
| C10α | Miniopteridae | 7 | AS, AUS, EU, SEA | 935 bp | X (AS) | |
| C11α | 6 | AFR, AS, SEA | 935 bp | X (AFR, AS) | ||
| C12α | Vespertilionidae | 1 | EU | 412 bp | ||
| C13α | 1 | AS | 408 bp | |||
| C14α | 2 | EU | 817 bp | |||
| C15α | 1 | AS | 935 bp | |||
| C16α | 2 | AS-SEA | 935 bp | X (AS) | ||
| C17α | 2 | AS | 415 bp | |||
| C18α | 3 | AS, EU | 935 bp | X (AS) | ||
| C19α | 1 | EU | 392 bp | |||
| C20α | 1 | EU | 816 bp | |||
| C21α | 1 | EU | 403 bp | |||
| C22α | 1 | EU | 403 bp | |||
| C23α | 2 | EU | 403 bp | |||
| C24α | 2 | LAM | 393 bp | |||
| C25α | 2 | NAM | 935 bp | X | ||
| C26α | Hipposideridae | 1 | AFR | 935 bp | ||
| C27α | 1 | AFR | 935 bp | |||
| C28α | 4 | AFR | 816 bp | X | ||
| C29β | Pteropodidae | 1 | AFR | 805 bp | ||
| C30β | 1 | AS | 935 bp | X | ||
| C31β | 2 | SEA | 422 bp | |||
| C32β | 1 | SEA | 394 bp | |||
| C33β | 2 | AFR, AS | 935 bp | X | ||
| C34β | 1 | AFR | 416 bp | |||
| C35β | 1 | AFR | 935 bp | X | ||
| C36β | Rhinolophidae | 10 | AFR, AS, EU | 935 bp | X (AS, EU) | |
| C37β | Hipposideridae | 5 | AFR, AS, SEA | 935 bp | X (AFR, AS) | |
| C38β | Nycteridae | 1 | AFR | 816 bp | X | |
| C39β | Vespertilionidae | 1 | AS | 935 bp | X | |
| C40β | 2 | AS | 935 bp | X | ||
| C41β | 1 | EU | 392 bp | |||
| C42β | 1 | AS | 408 bp | |||
| C43β | 2 | EU | 903 bp | |||
| C44β | 2 | AS, EU | 895 bp |
Sampling locations are indicated according to their large-scale geographic area, comprising Europe (EU), Africa (AFR), North America (NAM), Latin America (LAM) (Central and South America), Asia (AS), South East Asia (SEA), and Australia (AUS).
Indicates the presence of a spike protein sequence for one or more of the RdRp sequences included within the cluster; the sampling macro-area is indicated in brackets.
Amino acid diversity within and between clusters based on the RdRp and the spike protein, expressed as percentages (with SE). Spike protein data are only shown sequences for whose clusters correspond with those obtained from the RdRp.
| Cluster | Host genus | Mean within cluster amino acid divergence | Mean amino acid divergence from the closest group | ||
|---|---|---|---|---|---|
| RdRp | Spike | RdRp | Spike | ||
| C1α | 0.62 (0.005) | ||||
| C2α | 17.2 (0.023) – C28α | 60.97 (0.015) – Suncus_α | |||
| C3α | 13.3 (0.019) – C4α | ||||
| C4α | 13.3 (0.019) – C3α | ||||
| C5α | 12.9 (0.018) – C8α | ||||
| C6α | 0 (0) | ||||
| C7α | 11.72 (0.007) | 17 (0.019) – PEDV | 40.22 (0.01) – C10α | ||
| C8α | 7.8 (0.017) – C9α | ||||
| C9α | 19.42 (0.007) | 7.8 (0.017) – C8α | 47.45 (0.013) – C7α | ||
| C10α | 31.45 (0.009) | 6.8 (0.013) – C11α | 40.22 (0.01) – C7α | ||
| C11α | 20.59 (0.007) | 6.8 (0.013) – C10α | 41.45 (0.012) – C7α | ||
| C12α | 0 (0) | ||||
| C13α | 0 (0) | ||||
| C14α | 0 (0) | ||||
| C15α | 7.4 (0.018) – PEDV | ||||
| C16α | 3.73 (0.015) | 8.5 (0.018) – C18α | 37.81 (0.012) – PEDV | ||
| C17α | 1.86 (0.005) | ||||
| C18α | 2.60 (0.008) | 4.1 (0.017) – C20α | 39.64 (0.013) – PEDV | ||
| C19α | 0.86 (0.003) | ||||
| C20α | 0.69 (0.003) | 4.1 (0.017) – C18α | |||
| C21α | 0 (0) | ||||
| C22α | 0 (0) | ||||
| C23α | 0.69 (0.004) | ||||
| C24α | 0 (0) | ||||
| C25α | 0 (0) | 11 (0.02) – PEDV | 46.68 (0.012) – C16α | ||
| C26α | 6.3 (0.014) – NL63 | ||||
| C27α | 9.6 (0.02) – C26α | ||||
| C28α | Hipposideros | 19.07 (0.007) | 1.5 (0.006) – 229E | 17.46 (0.008) – 299E | |
| C29β | 1.52 (0.005) | ||||
| C30β | 1.03 (0.002) | 6.7 (0.014) – C33β | 36.63 (0.011) – C33β | ||
| C31β | 0.60 (0.005) | ||||
| C32β | 0 (0) | ||||
| C33β | 28.59 (0.007) | 6.7 (0.014) – C30β | 36.63 (0.011) – C30β | ||
| C34β | 0 (0) | ||||
| C35β | 0.07 (0.00) | 10.3 (0.015) – C33β | 47.98 (0.013) – C30β | ||
| C36β | 17.72 (0.01) | 4.6 (0.005) – SARSV | 21.96 (0.011) – SARSV | ||
| C37β | 48.29 (0.01) | 17.2 (0.021) – C36β | 61.14 (0.011) - SARSV | ||
| C38β | 6.6 (0.013) – C44β | 44.89 (0.013) – Hedgehog_CoV | |||
| C39β | 0.46 (0.001) | 6.2 (0.016) – C40β | 31.23 (0.013) – C40β | ||
| C40β | 16.3 (0.003) | 5.2 (0.013) – C43β | 31.23 (0.013) – C39β | ||
| C41β | 0 (0) | ||||
| C42β | 0.72 (0.007) | ||||
| C43β | 1.8 (0.007) – MERSV | ||||
| C44β | 3.7 (0.01) – MERSV/C43β | ||||
Distances calculated between RdRp sequences longer than 816 bp are indicated in bold.
Distances are only calculated between clusters containing sequences equal or longer than 816 bp, including those from non-flying mammals.
Clusters compatible with the inclusion within a single RGU.
Frequency of different evolutionary scenarios following co-phylogenetic reconciliation analysis (Jane) of the full set of sequences. Results are shown assuming equal or lower costs for co-divergence compared to the other possible evolutionary events.
| Association tested | Co-divergence | Virus lineage duplication | Host-shift | Virus loss | Failure of virus divergence | Co-divergence vs. other events (%) | |
|---|---|---|---|---|---|---|---|
| Co-divergence cost = other events | Mammals: α-βCoVs | 0 | 10 | 47 | 0 | 0 | 0 |
| Mammals: αCoVs | 0 | 6 | 27 | 0 | 0 | 0 | |
| Mammals: βCoVs | 0 | 2 | 21 | 0 | 0 | 0 | |
| Co-divergence cost < other events | Mammals: α-βCoVs | 11 | 8 | 38 | 3 | 0 | 18.3 |
| Mammals: αCoVs | 5 | 5 | 23 | 1 | 0 | 14.7 | |
| Mammals: βCoVs | 3 | 3 | 17 | 0 | 0 | 13 |
Summary of biological information shared between the donor and recipient CoV hosts.
| CoV donor cluster | Host genus | No. of cross-species transmissions (RdRp) | Availability of corresponding spike sequences | Characteristics shared between recipient and donor hosts | Co-roosting of hosts (documented) | Co-roosting of hosts (potential) | RdRp | Spike | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Donor | Recipient | Host family | Host superfamily | Sampling country | CoV divergent from donor cluster | ||||||
| 3α | 2 | na | X | X | – | – | – | X | na | ||
| 7α | 1 | 0 | – | X | X | X | X | – | na | ||
| 8α | 2 | na | – | – | X | – | – | – | na | ||
| 8α | 1 | na | – | – | X | – | X | – | na | ||
| 9α | 1 | 0 | – | – | X | – | – | – | na | ||
| 9α | 1 | 0 | – | – | X | X | X | – | na | ||
| 9α | 2 | 2 | – | – | X | – | X | – | X | ||
| 9α | 1 | 0 | – | – | X | – | X | – | na | ||
| 10α | 1 | 0 | – | – | X | X | X | X | na | ||
| 10α | 1 | 0 | – | – | X | – | X | – | na | ||
| 10α | 1 | 0 | – | X | X | – | – | X | na | ||
| 11α | 1 | 0 | – | – | X | – | – | X | na | ||
| 11α | 1 | 0 | – | X | – | – | – | – | na | ||
| 11α | 3 | 0 | – | – | X | X | X | – | na | ||
| 14α | 1 | na | X | X | X | – | – | – | na | ||
| 18α | 1 | na | – | – | X | – | X | – | na | ||
| 31β | 1 | na | – | – | X | – | – | – | na | ||
| 36β | 1 | 1 | – | – | X | – | X | – | – | ||
| 44β | 1 | na | X | X | X | – | – | X | na | ||
| 44β | 3 | na | X | X | X | – | – | – | na | ||
na: not applicable.
Documented by associated literature.
Potential co-roosting was based on roost sharing of sympatric species, based on information provided by IUCN (http://www.iucnredlist.org consulted on January 2016).