| Literature DB >> 32289821 |
Xuhua Xia1,2.
Abstract
Wild mammalian species, including bats, constitute the natural reservoir of betacoronavirus (including SARS, MERS, and the deadly SARS-CoV-2). Different hosts or host tissues provide different cellular environments, especially different antiviral and RNA modification activities that can alter RNA modification signatures observed in the viral RNA genome. The zinc finger antiviral protein (ZAP) binds specifically to CpG dinucleotides and recruits other proteins to degrade a variety of viral RNA genomes. Many mammalian RNA viruses have evolved CpG deficiency. Increasing CpG dinucleotides in these low-CpG viral genomes in the presence of ZAP consistently leads to decreased viral replication and virulence. Because ZAP exhibits tissue-specific expression, viruses infecting different tissues are expected to have different CpG signatures, suggesting a means to identify viral tissue-switching events. The author shows that SARS-CoV-2 has the most extreme CpG deficiency in all known betacoronavirus genomes. This suggests that SARS-CoV-2 may have evolved in a new host (or new host tissue) with high ZAP expression. A survey of CpG deficiency in viral genomes identified a virulent canine coronavirus (alphacoronavirus) as possessing the most extreme CpG deficiency, comparable with that observed in SARS-CoV-2. This suggests that the canine tissue infected by the canine coronavirus may provide a cellular environment strongly selecting against CpG. Thus, viral surveys focused on decreasing CpG in viral RNA genomes may provide important clues about the selective environments and viral defenses in the original hosts.Entities:
Keywords: COVID-19; SARS-CoV-2; canine intestine; viral evolution; zinc finger antiviral protein
Mesh:
Substances:
Year: 2020 PMID: 32289821 PMCID: PMC7184484 DOI: 10.1093/molbev/msaa094
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1Different host species of BetaCoV have different combinations of viral genomic GC% and . SARS-CoV-2 and BatCoV RaTG13 are clear outliers with extraordinarily small ICpG values indicative of a host tissue with strong selection against CpG in the viral genome. The first three legends are viral taxonomic names. SARS: SARS CoVs, Rhinolophus bats: BetaCoVs isolated from bats in the genus Rhinolophus, Civet+Ferret-badger: BetaCoVs isolated from civet and ferret badger, Ungulate_CoV: BetaCoVs isolated from ungulates (including bovine and equine CoV as well as porcine hemagglutinating encephalomyelitis virus). Human CoV HKU1 and Human CoV OC43 are two members of the viral species BetaCoV1.
. 2Genomic GC proportion and ICpG for all known BetaCoVs, with a complete genome (≥27,000 nt) and host information. No BetaCoVs from their natural hosts have the genomic ICpG and GC% combination close to SARS-CoV-2 and BatCoV RaTG13. New legends not explained in figure 1 are: MERS: MERS CoV; dromedary camel: BetaCoVs isolated from dromedary camels; Hedgehog CoV, Murine CoV, Rattus: BetaCoVs isolated from hedgehog, mouse, and rats; Rabbit HKU14: BetaCoV HKU14 strains isolated from rabbit; Human enteric CoV and Canine respiratory CoV are taxonomic names; Rhinolophus bat, Vespertilionidae, Rousettus bat, Hipposideros: BetaCoV isolated from bats in the genus Rhinolophus, in the family Vespertilionidae, in the genus Rousettus, and in the genus Hipposideros, respectively.
. 3SARS-CoV-2 may have evolved in mammalian digestive tract. (A) Genomic GC% and ICpG for all alphacoronaviruses with a complete genome (≥27,000 nt) and host information. Only CCoV (intestinal pathogen) genomes have GC% and ICpG combination close to SARS-CoV-2+BatCoV RaTG13. The green arrow points to CCoV (accession KC175339 in the phylogeny) that had been propagated extensively in cell culture before sequencing. (B) Phylogeny from the alignment of all sequenced CCoV genomes, with leaf name in the format of (ACCN: ICpG). Genomes were aligned with MAFFT (Katoh et al. 2009) with the FFT-NS-2 option (more accurate than default). PhyML(Guindon and Gascuel 2003) with the GTR+Γ substitution model and “best” option were used to search for the best tree. (C) Tissue-specific gene expression of ACE2, with data from Fagerberg et al. (2014).