| Literature DB >> 22514697 |
Herman Tse1, Wan-Mui Chan, Kenneth S M Li, Susanna K P Lau, Patrick C Y Woo, Kwok-Yung Yuen.
Abstract
Sapovirus is a genus of caliciviruses that are known to cause enteric disease in humans and animals. There is considerable genetic diversity among the sapoviruses, which are classified into different genogroups based on phylogenetic analysis of the full-length capsid protein sequence. While several mammalian species, including humans, pigs, minks, and dogs, have been identified as animal hosts for sapoviruses, there were no reports of sapoviruses in bats in spite of their biological diversity. In this report, we present the results of a targeted surveillance study in different bat species in Hong Kong. Five of the 321 specimens from the bat species, Hipposideros pomona, were found to be positive for sapoviruses by RT-PCR. Complete or nearly full-length genome sequences of approximately 7.7 kb in length were obtained for three strains, which showed similar organization of the genome compared to other sapoviruses. Interestingly, they possess many genomic features atypical of most sapoviruses, like high G+C content and minimal CpG suppression. Phylogenetic analysis of the viral proteins suggested that the bat sapovirus descended from an ancestral sapovirus lineage and is most closely related to the porcine sapoviruses. Codon usage analysis showed that the bat sapovirus genome has greater codon usage bias relative to other sapovirus genomes. In summary, we report the discovery and genomic characterization of the first bat calicivirus, which appears to have evolved under different conditions after early divergence from other sapovirus lineages.Entities:
Mesh:
Year: 2012 PMID: 22514697 PMCID: PMC3325917 DOI: 10.1371/journal.pone.0034987
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Species distribution of specimens and RT-PCR surveillance results in the present study.
| Bat species | Number of animals screened by RT-PCR | Number (%) of animals with positive detection of SaV |
|
| 14 | - |
|
| 103 | - |
|
| 78 | - |
|
| 18 | - |
|
| 65 | - |
|
| 14 | - |
|
| 321 | 5 (1.56%) |
|
| 9 | - |
|
| 84 | - |
|
| 1 | - |
|
| 2 | - |
|
| 1 | - |
|
| 9 | - |
|
| 9 | - |
Comparison of genomic features among selected SaV.
| Virus | Genogroup | Genome length (nt) | Genomic G+C content (mol%) | ORF1 length (nt) | ORF2 length (nt) | 3′-UTR length (nt) | ORF2 reading frame relative to ORF1 | Presence of putative ORF3 |
| Bat SaV/TLC58/HK | N/A | 7696 | 60.2 | 6855 | 615 | 225 | +1 | Y |
| SaV/Manchester | I | 7431 | 50.7 | 6843 | 498 | 82 | −1 | Y |
| SaV/Mc10 | II | 7548 | 51.5 | 6837 | 501 | 108 | −1 | N |
| Porcine enteric calicivirus | III | 7320 | 53.5 | 6765 | 495 | 55 | −1 | N |
| SaV/Hu/Chiba/000671/1999/JP | IV | 7420 | 53.5 | 6816 | 504 | 91 | −1 | Y |
| SaV/Hu/Ehime475/2004/JP | V | 7500 | 49.8 | 6906 | 501 | 83 | −1 | Y |
| Canine SaV | N/A | Incomplete | Incomplete | Incomplete | 501 | 141 | +1 | N |
| California sea lion 1 SaV | N/A | Incomplete | Incomplete | Incomplete | 504 | Incomplete | −1 | N |
Only the formally recognized SaV genogroups are included.
Complete genome sequence is not available for canine SaV and California sea lion 1 SaV.
Sequence accession numbers are as follows: Bat SaV/TLC58/HK (JN899075), SaV/Manchester (X86560), SaV/Mc10 (NC_010624), porcine enteric calicivirus (PEC) (AF182760), SaV/Hu/Chiba/000671/1999/JP (AJ786349), SaV/Hu/Ehime475/2004/JP (DQ366344), canine SaV (JN387134), California sea lion 1 SaV (JN420370).
Figure 1Genome organization of the bat SaV TLC58/HK.
The genome organization of the bat SaV TLC58/HK in comparison with the genome organization of human SaV GI strain Mancheseter, human SaV GII strain Mc10, porcine enteric calicivirus, and norovirus GII strain MD145.
Comparison of genome identities and amino acid identities between the predicted polyproteins of bat SaV and the selected SaV.
| Viruses | Genogroup | Bat SaV/TLC34/HK | Bat SaV/TLC39/HK | Bat SaV/TLC58/HK | ||||||||||||
| Pairwise nt identity (%) | Pairwise amino acid identity (%) | Pairwise nt identity (%) | Pairwise amino acid identity (%) | Pairwise nt identity (%) | Pairwise amino acid identity (%) | |||||||||||
| Complete genome | ORF1 | Pro-Pol | VP1 | VP2 | Complete genome | ORF1 | Pro-Pol | VP1 | VP2 | Complete genome | ORF1 | Pro-Pol | VP1 | VP2 | ||
| SaV/Manchester | GI | 44.7 | 36.3 | 47.2 | 36.3 | 16.9 | 44.8 | 36.4 | 47.5 | 35.8 | 16.9 | 44.8 | 36.4 | 47.5 | 36.1 | 16.9 |
| SaV/Mc10 | GII | 45.2 | 37.1 | 48.1 | 37.5 | 17.4 | 45.2 | 37.1 | 48.2 | 37.5 | 17.4 | 45.2 | 37.1 | 48.2 | 37.3 | 17.4 |
| PEC | GIII | 45.3 | 36.5 | 45.4 | 36.3 | 15.5 | 45.3 | 36.7 | 45.7 | 36.6 | 15.5 | 45.3 | 36.6 | 45.7 | 36.3 | 15.5 |
| SaV/Chiba/000671 | GIV | 46.1 | 37.3 | 48.2 | 38.5 | 19.9 | 46.0 | 37.4 | 48.4 | 38.2 | 19.9 | 46.0 | 37.4 | 48.4 | 38.3 | 19.9 |
| SaV/Ehime475 | GV | 44.2 | 35.9 | 45.9 | 36.9 | 19.4 | 44.2 | 36.0 | 46.1 | 37.1 | 19.4 | 44.2 | 36.0 | 46.1 | 36.9 | 19.4 |
| Canine SaV | N/A | N/A | N/A | 46.7 | 39.3 | 16.5 | N/A | N/A | 47.0 | 39.2 | 16.5 | N/A | N/A | 47.0 | 39.2 | 16.5 |
| California sea lion 1 SaV | N/A | N/A | N/A | 45.2 | 38.9 | 17.9 | N/A | N/A | 45.3 | 38.9 | 17.9 | N/A | N/A | 45.3 | 38.9 | 17.9 |
| Bat SaV/TLC34/HK | N/A | - | - | - | - | - | 99.5 | 99.3 | 99.7 | 98.5 | 100 | 99.6 | 99.6 | 99.7 | 99.2 | 100 |
| Bat SaV/TLC39/HK | N/A | 99.5 | 99.3 | 99.7 | 98.5 | 100 | - | - | - | - | - | 99.8 | 99.6 | 100 | 99.0 | 100 |
| Bat SaV/TLC58/HK | N/A | 99.6 | 99.6 | 99.7 | 99.2 | 100 | 99.8 | 99.6 | 100 | 99.0 | 100 | - | - | - | - | - |
Pro-Pol: Protease - Polymerase.
Figure 2Unrooted maximum-likelihood tree based on full-length amino acid sequences of ORF1 precusor polyprotein.
SH-like aLRT branch support values of greater than 0.70 are shown besides major branches. Scale bar indicates the number of inferred substitutions per site.
Figure 3Unrooted maximum-likelihood trees of VP1 and VP2.
The trees were constructed based on the full-length amino acid sequences of (a) VP1 major capsid protein, and (b) VP2 minor structural protein. SH-like aLRT branch support values of greater than 0.70 are shown besides major branches. Scale bar indicates the number of inferred substitutions per site.
Figure 4Scatterplot of codon usage.
The scratterplot of codon usage summary statistics Nc and Nc′ against the proportion of G or C nucleotides at the 3rd position of synonymous codons (GC3s), showing greater codon usage bias in the bat SaV genome relative to other SaV genomes. Unlike the porcine enteric caliciviruses, the observed difference in codon usage bias persists with adjustment of background nucleotide composition (Nc′).
CpG dinucleotide bias in selected SaV genomes, as assessed by the odds ratio of CpG (ρCG) and other measures.
| Virus | SaV genogroup | ρCG | ρGC | CpG/GpC ratio | GC skew |
| Bat SaV/TLC58/HK | N/A | 0.792 | 1.017 | 0.778 | −0.011 |
| SaV/Manchester | I | 0.530 | 0.954 | 0.556 | −0.017 |
| SaV/Mc10 | II | 0.587 | 0.938 | 0.626 | −0.032 |
| Porcine enteric calicivirus | III | 0.520 | 1.016 | 0.511 | −0.016 |
| SaV/Hu/Chiba/000671/1999/JP | IV | 0.665 | 0.976 | 0.681 | −0.045 |
| SaV/Hu/Ehime475/2004/JP | V | 0.490 | 0.968 | 0.507 | −0.036 |
Significantly less CpG suppression was found in the bat SaV genome, while a similar degree of negative GC skew was observed in all SaV genomes.