Literature DB >> 35467426

Epidemiology and Genomic Characterization of Two Novel SARS-Related Coronaviruses in Horseshoe Bats from Guangdong, China.

Linmiao Li1,2, Libiao Zhang2, Jiabin Zhou2, Xiangyang He2, Yepin Yu2, Ping Liu2, Wenzhong Huang2, Zuofu Xiang3, Jinping Chen2.   

Abstract

Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV) and SARS-CoV-2, the causative agents of SARS, which broke out in 2003, and coronavirus disease 2019 (COVID-2019), which broke out in 2019, probably originated in Rhinolophus sinicus and R. affinis, respectively. Rhinolophus bats are important hosts for coronaviruses. Many SARS-related coronaviruses (SARSr-CoVs) have been detected in bats from different areas of China; however, the diversity of bat SARSr-CoVs is increasing, and their transmission mechanisms have attracted much attention. Here, we report the findings of SARSr-CoVs in R. sinicus and R. affinis from South China from 2008 to 2021. The full-length genome sequences of the two novel SARSr-CoVs obtained from Guangdong shared 83 to 88% and 71 to 72% nucleotide identities with human SARS-CoV and SARS-CoV-2, respectively, while sharing high similarity with human SARS-CoV in hypervariable open reading frame 8 (ORF8). Significant recombination occurred between the two novel SARSr-CoVs. Phylogenetic analysis showed that the two novel bat SARSr-CoVs from Guangdong were more distant than the bat SARSr-CoVs from Yunnan to human SARS-CoV. We found that transmission in bats contributes more to virus diversity than time. Although our results of the sequence analysis of the receptor-binding motif (RBM) and the expression pattern of angiotensin-converting enzyme 2 (ACE2) inferred that these viruses could not directly infect humans, risks still exist after some unpredictable mutations. Thus, this study increased our understanding of the genetic diversity and transmission of SARSr-CoVs carried by bats in the field. IMPORTANCE Severe acute respiratory syndrome coronavirus (SARS-CoV) and SARS-CoV-2 probably originated from the SARS-related coronaviruses (SARSr-CoVs) carried by Rhinolophus bats from Yunnan, China. Systematic investigations of the reservoir hosts carrying SARSr-CoVs in Guangdong and the reservoir distribution and transmission are urgently needed to prevent future outbreaks. Here, we detected SARSr-CoV in Rhinolophus bat samples from Guangdong in 2009 and 2021 and found that the transmission of SARSr-CoV from different host populations contributes more to increased virus diversity than time. Bat SARSr-CoVs in Guangdong had genetic diversity, and Guangdong was also the hot spot for SARSr-CoVs. We once again prove that R. sinicus plays an important role in the maintenance of the SARS-CoVs. Besides, the SARSr-CoVs are mainly transmitted through the intestines in bats, and these SARSr-CoVs found in Guangdong could not use human ACE2 (hACE2), but whether they can pass through intermediate hosts or directly infect humans requires further research. Our findings demonstrate the ability of SARSr-CoVs to spread across species.

Entities:  

Keywords:  SARS-related coronavirus; epidemiology; genetic diversity; horseshoe bats

Mesh:

Substances:

Year:  2022        PMID: 35467426      PMCID: PMC9239062          DOI: 10.1128/mbio.00463-22

Source DB:  PubMed          Journal:  mBio            Impact factor:   7.786


INTRODUCTION

Emerging infectious diseases (EIDs) pose a great threat to global public health. Approximately 60% to 80% of human EIDs originate from wildlife, and bats may be natural reservoirs for a large variety of viruses, including many important zoonotic viruses that cause severe diseases in humans and domestic animals, especially severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), Middle East respiratory syndrome coronavirus (MERS-CoV), and SARS-CoV-2, which caused outbreaks in various countries (1–4). These three coronaviruses spread across species through one or more intermediate hosts to achieve zoonotic spillover, during which viral mutation, recombination, and/or amplification may occur (5–7). It is worth noting that horseshoe bats (Rhinolophidae) are thought to be the natural hosts of SARS-CoV and SARS-CoV-2. Some research also demonstrated that Rhinolophus bats may be important hosts for coronaviruses (8, 9). Coronavirus diversity in bats is thought to be shaped by both species richness and geographic distribution. In China, horseshoe bat species are widely distributed and are also the most frequent SARS-related coronavirus (SARSr-CoV) carriers (7, 10–12), particularly Rhinolophus sinicus and R. affinis. Bats in Yunnan have been found to carry the most SARSr-CoVs, some of which have been successfully isolated and proven to use human angiotensin-converting enzyme 2 (hACE2) as the receptor for viral internalization (13–15). Through whole-genome sequence analysis, the origin and evolution of the virus can be better traced. Besides, the complete genomic sequences of SARSr-CoVs have been obtained from Rhinolophus bats in some provinces, so the full-length genome sequence of the SARSr-CoV strain carried by bats from Guangdong is urgently needed for virus traceability. South China, especially Guangdong province, is rich in wildlife. There are more than 60 species of bats in Guangdong. Although SARS-CoV was first reported in humans in Guangdong and caused fatal respiratory infections in close to 800 people worldwide in 2002, subsequent investigations have identified horseshoe bats (genus Rhinolophus) from Yunnan as the natural reservoirs of SARS-CoV (13). In 2016, swine acute diarrhea syndrome coronavirus (SADS-CoV) caused the death of over 25,000 pigs in farms within Guangdong province. This virus originated from Rhinolophus species bats (16). As a natural reservoir of viruses, bats harbor numerous other SARSr-CoVs that could potentially infect humans around the world, causing a SARS- or coronavirus disease 2019 (COVID-19)-like pandemic in the future. Thus, it is necessary to discover more virus diversity to avoid future outbreaks. South China was also a hub for illegal wildlife trade; therefore, it is very necessary to investigate and understand the diversity and transmission of coronaviruses carried by Rhinolophus species bats from South China to prevent a public health event.

RESULTS

Epidemiology of SARSr-CoVs in Rhinolophus bats from South China.

A total of 389 anal swabs from 201 R. sinicus, 177 R. affinis, and 11 Hipposideros pomona bats were collected in roost caves in Guangdong (including Guangzhou, Huizhou, and Shenzhen) and Hainan (see Table S1 in the supplemental material), over the 12-year study period. These specimens were tested for the presence of betacoronaviruses by Reverse Transcription-Polymerase Chain Reaction (RT-PCR) targeting the 360-nucleotide (nt) S gene fragment. In total, 22 samples from R. sinicus (17 individuals from Guangzhou and 5 individuals from Huizhou near Guangzhou) and 6 samples from R. affinis (5 individuals from Guangzhou and 1 from Huizhou) tested positive for betacoronaviruses (Table 1; Table S1). Sequencing of the PCR amplicons revealed that the 28 sequences were bat SARSr-CoVs closely related to strains from Guangdong, Jiangxi, Fujian, and Hongkong (Fig. S1).
TABLE 1

Results of samples for CoV testing by RT-PCR

LocationaDate (mo and yr) Rhinolophus sinicus
Rhinolophus affinis
Hipposideros pomona
No. of samplesNo. of betacoronavirus-positive samplesPositivity rate (%)No. of samplesNo. of betacoronavirus-positive samplesPositivity rate (%)No. of samplesNo. of betacoronavirus-positive samplesPositivity rate (%)
GuangzhouNovember 200911218.18000000
April 202041512.201500000
September 202034514.711230.25000
December 2020000.00000800
March 202131516.134124.88300
May 2021300.001700000
 
HuizhouOctober 2012200.00000000
May and June 20132827.141815.56000
May and June 20203266.676100000
August 20204125.00000000
 
ShenzhenSeptember 20201500.001300000
 
HainanJuly 20082900.00000000
 
 
Total2012210.9517763.391100

Guangzhou, Huizhou, and Shenzhen are the cities in Guangdong.

Results of samples for CoV testing by RT-PCR Guangzhou, Huizhou, and Shenzhen are the cities in Guangdong. Phylogenetic tree based on nucleotide sequences obtained by RT-PCR using ZH2 primers. The tree was constructed by the maximum likelihood method using the MrBayes approach employing the GTR+I+G nucleotide substitution model. The red letters represent the SARSr-CoV strains isolated from Rhinolophus sinicus; the blue letters represent the SARSr-CoV strains isolated from R. affinis. Download FIG S1, PDF file, 0.2 MB. Sample information of bats. Download Table S1, PDF file, 0.4 MB. To understand the genetic diversity of these bat SARSr-CoVs, the complete S genes of the SARSr-CoV strains were amplified and sequenced. Due to the low viral loads in some samples, S gene sequences were successfully amplified from 7 samples, including Rs56, Rs67, Rs68, Rs87, RaCH025, RaCH027, and RaCH039 from Guangzhou, and most of the S gene sequences were obtained from 1 sample (Rs200609) from Huizhou in 2020 and 2 samples (Rs150 and Rs183) from Guangzhou in 2009. These S gene sequences showed high similarity among themselves. Interestingly, compared with positive samples (Rs56, Rs67, Rs68, Rs87, RaCH025, RaCH027, and RaCH039) from Guangzhou in 2020, the virus sequences of samples (Rs150 and Rs183) from Guangzhou in 2009 were similar, suggesting that this virus may have existed for a long period and is almost conserved in the Guangzhou population (Fig. S2). However, different from the S gene sequence of SARSr-CoV isolated from bats from Huizhou, we found 17 amino acid (aa) variation sites, including a 1-aa insertion site in the Rs200609 sequence compared with the sequences of Rs56/67/68/87 (Fig. S3), which clearly suggested that SARSr-CoV may be mutated by transmission from different geographical populations. But in the same geographical location, the SARSr-CoV carried by R. sinicus (Rs56/87/67/68) and R. affinis (RaCH025/27/39) also has a 1-aa variation (Fig. S3), suggesting that host switching increases the mutation of the virus. Sequence alignment of SARSr-CoVs from Guangzhou in 2009 and 2020. Download FIG S2, PDF file, 0.3 MB. Amino acid sequence comparison of S genes of SARSr-CoVs from Guangzhou and Huizhou, Guangdong province. Download FIG S3, PDF file, 0.6 MB.

Viral metagenomics.

Viral nucleic acids of Rs56 were deep sequenced, and we then obtained a total of 11.6 Gb of data (13,571,072 clean reads; 150 bp in length). In total, 1,155,340 reads were best matched with viral proteins available in the NCBI NR database. The most widely distributed virus family was Coronaviridae, and the diverse reads related to this family occupied ∼97.92% of the total viral sequence reads (Fig. S4). Contig sequences were then generated by de novo assembly using MEGAHIT version 1.0 (17), generating 10,588 unique contigs. A taxonomic assignment of these contigs was performed based on BLAST analysis. At this stage, 6 contigs were confirmed for RNA virus species (Table 2), 1 of which was a novel SARSr-CoV with a length of 30,146 nt.
TABLE 2

Information on contigs with a high level of sequence similarity and then confirmed to be Coronavirinae

Query IDSubject ID (GenBank accession no.)Identity (%)Alignment length (nt)No. of mismatchesNo. of gap openingsq_start (position)q_end (position)s_start (position)s_end (position)E valueBit scoreTaxonomy
Contig_4536 GQ153542.1 93.3327,7521,845423127978227751041,690SARSr-CoV
Contig_5449 DQ022305.2 98.3332404012236129099288609.60 × 10−114416SARSr-CoV
Contig_3133 KY770859.1 98.2462283112735327732279593.73 × 10−106390Bat CoV
Contig_6085 MK211378.1 98.5782112152273130042298321.13 × 10−97364Bat CoV/YN2018D
Contig_9629 MK211378.1 1001260057770230163300385.61 × 10−57228Bat CoV/YN2018D
Contig_9617 AY536760.3 94.048168623167184187.78 × 10−65253SARSr-CoV

Query ID: query sequence ID; Subject ID: the target sequence ID; Identity (%): the percentage of consistency of sequence alignment; Alignment length: the length of the matched area; No. of mismatches: the number of mismatch bases in the matched area; No. of gap openings: the number of gaps in the matched area; q_ start: the starting point of the comparison region on the query sequence; q_end: the termination point of the matched area on the query sequence; s_start: the starting position of the matching region on the subject sequence; s_end: the termination site of the matched area on the subject sequence; E value: the expected value of the comparison result; Bit score: the bit score value of the comparison result; Taxonomy: Annotation information.

Information on contigs with a high level of sequence similarity and then confirmed to be Coronavirinae Query ID: query sequence ID; Subject ID: the target sequence ID; Identity (%): the percentage of consistency of sequence alignment; Alignment length: the length of the matched area; No. of mismatches: the number of mismatch bases in the matched area; No. of gap openings: the number of gaps in the matched area; q_ start: the starting point of the comparison region on the query sequence; q_end: the termination point of the matched area on the query sequence; s_start: the starting position of the matching region on the subject sequence; s_end: the termination site of the matched area on the subject sequence; E value: the expected value of the comparison result; Bit score: the bit score value of the comparison result; Taxonomy: Annotation information. Virus reads classified at the family level by viral metagenomics. Download FIG S4, PDF file, 0.1 MB.

Genomic characterization of the novel SARSr-CoVs.

The full-length genomes of two novel bat SARSr-CoVs, strains named by abbreviation of the bat species and sample identifier (RaCH025 and Rs56), were 29,622 nt and 30,146 nt, respectively, and the gene structure was similar to those of SARS-CoV and other bat SARSr-CoVs (Fig. 1). Rs56 was assembled by high-throughput sequencing (contig_4536) (Table 2) and verified to be 29,674 nt by RT-PCR with 70 pairs of feasible primers (Table S2A) without the 255 nt at the 5′ terminus and the 217 nt at the 3′ terminus. However, the complete genome of the RaCH025 strain was obtained by RT-PCR with 64 pairs of feasible primers (Table S2B). The genomes of the two SARSr-CoVs in the same cave shared 97.95% nucleotide sequence identity, and the main difference existed in open reading frame 8 (ORF8), with a similarity of only 34.43% (Table 3). The overall nucleotide sequence identities between the two novel SARSr-CoVs and human SARS-CoV and SARS-CoV-2 were 83 to 88% and 71 to 72%, respectively, lower than that observed for bat SARSr-CoVs reported from other locations in China (88 to 96%) (6, 7, 10, 12, 18, 19). Comparatively speaking, Rs56 and RaCH025 had high similarity with 18 strains downloaded from the NCBI database from Guangdong with complete sequences (Table S3), which was examined by Simplot analysis (Fig. 1).
FIG 1

Gene map of the two novel SARSr-CoVs and similarity plot based on the full-length genome sequence of SARS-CoV-SZ3. Full-length genome sequences of Bat-SARSr-CoV-RaCH025, Bat-SARSr-CoV-Rs56, Bat-SARSr-CoV-HKU3-7/HKU3-8, and Bat-SARSr-CoV-GD2016B/2017F/2017G/2017H/2017I/2017J/2017L/2017M/2017N/2017O/2017P/2017Q/2017W/2019A/2019B/2019D were used as reference sequences. The analysis was performed with the Kimura model, with a window size of 1,500 bp and a step size of 150 bp.

TABLE 3

Nucleotide and protein sequence identities among the whole genome and each gene or region of SARS-CoV, SARS-CoV-2, and other bat SARSr-CoVs against Bat-SARSr-CoV-Rs56

Virus nameNucleotide/amino acid sequence identity (%)a
Whole genomeSRBDEMNORF1abRdRpORF3aORF6ORF7aORF7bORF8
Human-SARS-CoV-272.95/81.2065.57/73.6562.05/69.1292.11/9281.23/90.0587.14/89.5075.41/86.2486.27/96.2370.91/72.9969.89/68.8581.15/85.12/32.79/20.66
Human-SARS-CoV-GZ0287.98/90.5172.76/78.8058.20/58.9297.40/10093.54/96.3395.97/96.9191.63/96.0894.28/97.3582.42/85.0493.75/92.0691.87/92.6292.59/93.1897.29/97.54
Civet-SARS-CoV-civet01087.82/90.4272.62/78.5758.90/59.4697.40/10093.09/95.9395.89/96.6791.60/95.9692.76/95.8681.45/83.2193.75/92.0691.87/92.6292.59/93.1897.29/97.54
Bat-SARSr-CoV-RaCH025/Guangdong 97.95/98.4499.73/99.9299.84/10099.57/10097.30/99.0998.34/99.2998.62/99.2899.89/99.7899.88/99.6496.88/98.4198.37/99.18100/10034.43/25.62
Bat-SARSr-CoV-WIVI/Yunnan82.22/88.9073.08/79.3759.61/61.0897.84/10094.14/98.1995.89/96.6792.14/96.2492.94/95.8682.30/85.0492.19/90.4892.14/95.9091.85/95.4531.14/24.79
Bat-SARSr-CoV-YN2013/Yunnan90.62/94.7278.20/85.3280.62/90.2097.83/10092.19/95.0296.29/97.1592.23/96.4194.96/97.7386.06/89.7894.79/88.0091.87/95.0889.63/93.1898.37/98.36
Bat-SARSr-CoV-Rs6272006/Guizhou90.39/91.2083.60/91.7077.78/86.4997.83/10093.84/97.7496.60/97.3992.00/96.0992.97/95.9786.79/89.7893.75/93.6593.77/96.7291.85/95.4531.14/24.79
Bat-SARSr-CoV-GX2013/Guangxi93.53/96.8391.92/97.2680.24/88.1198.70/10094.89/97.2896.52/97.3993.30/96.7392.94/95.8694.19/95.2694.79/92.0696.21/98.3696.29/10098.10/96.72
Bat-SARSr-CoV-HuB2013/Hubei88.92/73.2280.19/88.7274.60/85.4198.70/98.6895.80/97.7496.28/97.3890.16/97.0388.56/95.4490.06/94.1697.40/10090.51/95.9091.11/95.4533.06/22.31
Bat-SARSr-CoV-HKU3-1/Hongkong92.52/96.8494.65/97.8282.01/88.11100/10096.85/97.7497.47/98.1092.17/97.7690.29/96.6096.97/95.9999.48/98.4195.66/97.5498.52/10034.15/24.79
Bat-SARSr-CoV-ZXC21/Zhejiang76.82/86.7872.81/97.8269.66/82.1691.67/92.0080.18/90.0588.73/90.9378.97/86.7590.75/96.6070.91/71.5365.57/69.3577.87/80.9971.21/74.4135.25/20.66

RdRp, RNA-dependent RNA polymerase. RBD, Receptor-Binding Domain. “/” is used to separate the data before and after. Bold in the column of virus name represents the strain obtained in this study.

Gene map of the two novel SARSr-CoVs and similarity plot based on the full-length genome sequence of SARS-CoV-SZ3. Full-length genome sequences of Bat-SARSr-CoV-RaCH025, Bat-SARSr-CoV-Rs56, Bat-SARSr-CoV-HKU3-7/HKU3-8, and Bat-SARSr-CoV-GD2016B/2017F/2017G/2017H/2017I/2017J/2017L/2017M/2017N/2017O/2017P/2017Q/2017W/2019A/2019B/2019D were used as reference sequences. The analysis was performed with the Kimura model, with a window size of 1,500 bp and a step size of 150 bp. Nucleotide and protein sequence identities among the whole genome and each gene or region of SARS-CoV, SARS-CoV-2, and other bat SARSr-CoVs against Bat-SARSr-CoV-Rs56 RdRp, RNA-dependent RNA polymerase. RBD, Receptor-Binding Domain. “/” is used to separate the data before and after. Bold in the column of virus name represents the strain obtained in this study. Primers used to obtain the complete sequence and S gene of bat SARSr-CoV. (A) Primers used to obtain the complete sequence of Bat-SARSr-CoV-Rs56. (B) Primers used to obtain the complete sequence of Bat-SARSr-CoV-RaCH025. (C) Primers used to obtain the S gene of CoVs from positive samples. Download Table S2, PDF file, 0.4 MB. Information on betacoronaviruses in the GenBank database. Download Table S3, PDF file, 0.2 MB.

Evolution analysis of SARSr-CoVs.

The potential recombination events between Rs56, RaCH025, HKU3-1, GX2013, and HuB2013 were examined by RDP4 and Simplot. Evidence of a recombination event was detected in the genome of the novel SARSr-CoV RaCH025; Rs56 and HKU3-1 were suggested to be the major and minor parents of RaCH025, respectively (P value of <10−130), and RaCH025 and HKU3-1 were suggested to be the major and minor parents of Rs56, with a strong P value (<10−100). The results suggested that significant recombination occurred between RaCH025 and Rs56 from bats in the same cave, and Rs56 was likely to be a recombinant strain from three SARSr-CoVs harbored by bats, namely, RaCH025, HKU3-1, and GX2013, with four breakpoints at genome positions 7350, 10500, 27300, and 29100 (Fig. 2A). The major and minor parents of GX2013 were RaCH025 and HKU3-1, with a strong P value (<10−60). However, HKU3-1 was likely to be a recombinant strain from two SARSr-CoVs harbored by bats, namely, Rs56 and HuB2013, with a strong P value (<10−40), and breakpoints were identified at genome positions 8400, 10050, and 27600 (Fig. 2B). The results also revealed that the SARSr-CoVs carried by bats in Guangdong had recombined and genetically mutated with those carried by bats in surrounding areas.
FIG 2

Detection of potential recombination events by similarity plot and bootscan analyses. (A) The full-length genome sequence of SARSr-CoV-Rs56 was used as the query sequence, and the sequences of GX2013, HKU3-1, and RaCH025 were used as the reference sequences. (B) The full-length genome sequence of HKU3-1 was used as the query sequence, and SARSr-CoVs HuB2013, Rs56, and RaCH025 were used as the reference sequences. All analyses were performed with a Kimura model, with a window size of 1,500 bp and a step size of 150 bp.

Detection of potential recombination events by similarity plot and bootscan analyses. (A) The full-length genome sequence of SARSr-CoV-Rs56 was used as the query sequence, and the sequences of GX2013, HKU3-1, and RaCH025 were used as the reference sequences. (B) The full-length genome sequence of HKU3-1 was used as the query sequence, and SARSr-CoVs HuB2013, Rs56, and RaCH025 were used as the reference sequences. All analyses were performed with a Kimura model, with a window size of 1,500 bp and a step size of 150 bp. Phylogenetic trees were constructed using the nucleotides of the complete genome, the ORF1ab gene, the S gene, and the ORF8 gene. Phylogenetic analysis based on the whole genomic sequence showed that the SARSr-CoVs carried by Rhinolophus species bats have genetic diversity and appear to be associated with host geographic distribution. The two novel SARSr-CoVs acquired in this study formed one clade with other strains from Hongkong, Guangdong, Jiangxi, Fujian, and Guangxi, which showed a more distant phylogenetic relationship to SARS-CoV than the bat SARSr-CoVs from Yunnan (Fig. 3A). The tree topology observed in the nonstructural protein gene ORF1ab was not completely consistent with the tree constructed by the whole-genome sequence, but two novel SARSr-CoVs acquired in this study were also more distant from SARS-CoV than the bat SARSr-CoVs from Yunnan (Fig. S5A). Based on the high genetic diversity of the S gene, nearly all bat SARSr-CoVs from Guangzhou were closely clustered; however, the bat SARSr-CoV from Huizhou was clustered into another branch, which showed that geographical differences exist in these CoVs in Guangdong (Fig. 3B). The result showed that the relationship of Rs56 formed in one clade with GX2013, GD2017M/G, and JX2021M/N, but that of RaCH025 formed in another clade in the ORF8 region (Fig. S5B). Especially, Rs56, GX2013, YN2013, GD2017M/G/W, and JX2021M/N/P/O/A/J/K from Chinese horseshoe bats were close to SARS-CoV according to the phylogenetic tree of ORF8, suggesting a critical role for Chinese horseshoe bats in the maintenance of SARS-CoVs.
FIG 3

Phylogenetic trees based on nucleotide sequences of the whole genome (A) and the S gene (B). The trees were constructed by the maximum likelihood method using the MrBayes approach employing the GTR+I+G nucleotide substitution model. The red letters represent the SARSr-CoV strains isolated from Guangdong in this study. GD, Guangdong; FJ, Fujian; JX, Jiangxi; HK, Hongkong; HB, Hubei; YN, Yunnan; GX, Guangxi; GZ, Guizhou; LN, Liaoning; HN, Hunan; ZJ, Zhejiang.

Phylogenetic trees based on nucleotide sequences of the whole genome (A) and the S gene (B). The trees were constructed by the maximum likelihood method using the MrBayes approach employing the GTR+I+G nucleotide substitution model. The red letters represent the SARSr-CoV strains isolated from Guangdong in this study. GD, Guangdong; FJ, Fujian; JX, Jiangxi; HK, Hongkong; HB, Hubei; YN, Yunnan; GX, Guangxi; GZ, Guizhou; LN, Liaoning; HN, Hunan; ZJ, Zhejiang. Phylogenetic trees based on nucleotide sequences of ORF1ab (A) and ORF8 (B). The trees were constructed by the maximum likelihood method using the MrBayes approach employing the GTR+I+G (A) and GTR+G (B) nucleotide substitution models. The red letters represent the SARSr-CoV strains isolated from bats from Guangdong. Download FIG S5, PDF file, 0.2 MB.

Cross-species transmission capacity of the bat SARSr-CoVs from Guangdong.

R. sinicus and R. affinis belong to the bat genus Rhinolophus and are regarded as the primary natural hosts of SARS-CoV and SARS-CoV-2, respectively. In the field survey, we found two interesting bat roost caves in Guangzhou. The relative positions of these two roost caves in the mountain were up and down, with about 5 m of vertical distance (Fig. S6). We conducted field surveys on the two caves from 2013 to 2021 and found that R. sinicus and R. affinis lived in the caves together except during hibernation periods, when they moved to other areas for hibernation in winter. However, in our study, R. sinicus and R. affinis bats in which SARSr-CoVs were detected did not display obvious signs of disease, but there was a 2.05% nucleotide sequence difference between the genomes of the two SARSr-CoVs from the same cave. Thus, the accumulation of variation could increase the risk of cross-species transmission. Bat forms and schematic diagram of the bat roosting cave. (A) Rhinolophus sinicus and R. affinis. (B) Schematic diagram of the bat roosting cave. Download FIG S6, PDF file, 1.0 MB. To date, reports confirm that the S protein is responsible for the entry of the virus and is functionally divided into two domains, S1 and S2, which are responsible for receptor binding and cellular membrane fusion, respectively. S1 consists of two domains, the N-terminal domain (NTD) and the C-terminal domain (CTD), which is also known as the receptor-binding domain (RBD) in SARS-CoV. The S proteins of SARSr-CoV-Rs56 and -RaCH025 showed 78.80–78.88% and 78.57–78.64% amino sequence identities to those of human and civet SARS-CoVs, and their RBD proteins were 58.92% and 59.46% similar to those of human and civet SARS-CoVs, respectively. In the receptor-binding motif (RBM) region, among the five key binding sites (residues 442, 472, 479, 487, and 491) with the receptor ACE2 (20), only residue 491 was conserved (Fig. 4). Bat-SARSr-CoV-Rs4874/Rs3367/WIVI strains from Yunnan have been verified by experiments to be able to use hACE2 as the entry receptor. Compared to them, bat SARSr-CoV strains from Guangdong failed to bind hACE2 because of the presence of two deletions (5 aa and 12-13 aa) in the key regions of RBMs, which was consistent with other bat SARSr-CoVs such as SARSr-CoV-YN2013, SARSr-CoV-Rs672, SARSr-CoV-GX2013, SARSr-CoV-HuB2013, SARSr-CoV-HKU3-1, SARSr-CoV-ZXC21, and SARSr-CoV-ZC45 (Fig. 4). Thus, the SARSr-CoVs carried by bats in Guangdong could spread within their own population, but they cannot directly infect humans.
FIG 4

Sequence alignment of the receptor-binding motif (RBM) region. The red font represents the sequences obtained in this study. Yellow highlighting represents the 5 key sites where SARS-CoV binds to angiotensin-converting enzyme 2 (ACE2). Green shading represents amino acid (aa) deletions. The red words represent the sequences obtained in this study.

Sequence alignment of the receptor-binding motif (RBM) region. The red font represents the sequences obtained in this study. Yellow highlighting represents the 5 key sites where SARS-CoV binds to angiotensin-converting enzyme 2 (ACE2). Green shading represents amino acid (aa) deletions. The red words represent the sequences obtained in this study. ACE2 had great genetic diversity in bats, and their receptor usage is species dependent, so the pathogenicity and host range of bat SARSr-CoVs remain to be determined in the future. The previous research of the interaction of the S protein between ACE2 and SARS-CoV/SARS-CoV-2 indicated that 23/22 sites on ACE2 can bind to the S protein receptor-binding region (21). We compared and analyzed the 24 amino acid sites, and the results showed that R. sinicus and R. affinis contain 6 or 8 sites with different amino acid residues compared to human ACE2 (Table 4), so we inferred that humans are susceptible to bat SARSr-CoVs. In this study, we also investigated the mRNA expression levels of ACE2 in eight organs of R. sinicus and R. affinis using quantitative real-time-PCR (qRT-PCR) and found that the mRNA expression levels of ACE2 were highest in the intestine; moreover, its expression levels were also high in the heart, kidney, and liver (Fig. 5). Among these organs, R. sinicus and R. affinis exhibited the lowest mRNA levels in the lung, followed by the spleen and brain. Our results suggested that all these organs are at risk of SARSr-CoV infection and that the intestine was the main channel for the spread of the virus.
TABLE 4

Sequence comparison among different ACE2s at the critical amino acid site that binds the SARS-CoV and SARS-CoV-2 receptor-binding domains

Note that black numbers indicate contact with SARS-CoV-2 S and SARS-CoV S, red numbers indicate contact with SARS-CoV-2 S only, and green numbers indicate contact with SARS-CoV S only (21). Amino acid differences compared to hACE2 are indicated as blue characters. The yellow shading represents the different sites between R. sinicus and R. affinis. Rs, R. sinicus; Ra, R. affinis.

FIG 5

mRNA expression of ACE2 in different organs of Rhinolophus sinicus and R. affinis.

mRNA expression of ACE2 in different organs of Rhinolophus sinicus and R. affinis. Sequence comparison among different ACE2s at the critical amino acid site that binds the SARS-CoV and SARS-CoV-2 receptor-binding domains Note that black numbers indicate contact with SARS-CoV-2 S and SARS-CoV S, red numbers indicate contact with SARS-CoV-2 S only, and green numbers indicate contact with SARS-CoV S only (21). Amino acid differences compared to hACE2 are indicated as blue characters. The yellow shading represents the different sites between R. sinicus and R. affinis. Rs, R. sinicus; Ra, R. affinis.

DISCUSSION

Here, we discovered two novel bat SARSr-CoVs, which were 2% to 7% different from those previously reported in Guangdong province (22). Through evolutionary and recombination analyses, these viruses from Guangdong might be recombined by viruses carried by bats from surrounding areas such as Hongkong, Guangxi and Hubei provinces. Guangdong and Guangxi are important hubs for the spread of coronavirus. The outbreak of SARS in Guangdong caused a severe pandemic since 2002. Evidence revealed that palm civets (Paguma larvata) may be the intermediate host. Although R. sinicus in Yunnan has been found to be the natural host of SARS, the viruses carried by bats can spread across species and mutate quickly. In addition, the illegal wildlife trade flourishes in Guangdong. Therefore, the long-term monitoring of the coronaviruses and even other viruses carried by bats in Guangdong is very important. In our study, the region with the largest difference between the sequences of the two novel SARSr-CoVs found in bats from Guangdong was ORF8, and evolutionary analysis of ORF8 suggested that the two viruses are located in two different clades. Previous studies have shown that ORF8 is the most hypervariable and could be divided into 3 types (type I ORF8, type II ORF8, and type III ORF8) in bat lineage B betacoronaviruses (23). According to currently available data, R. sinicus is the only bat species with SARSr-CoVs containing 2 different types of ORF8s (types I and III) and also the only bat species that harbors SARSr-CoVs with type I ORF8 rarely (23). In our study, ORF8 of 12 bat SARSr-CoV strains (including SARSr-CoV-Rs56) carried by R. sinicus from Guangdong, Jiangxi, and Fujian belonged to type I (see Fig. S5B in the supplemental material). They were the most nearly identical to those of 2 CoVs, SARSr-CoV-GX2013 and SARSr-CoV-YN2013. The results suggested that R. sinicus plays an important role in the maintenance of SARS-CoVs. The ORF8 region, unique to SARSr-CoVs, is prone to mutations or deletions during interspecies transmission (1, 24). Compared to ORF8 of bat SARSr-CoVs, ORF8 from human SARS-CoV is under strong positive selection during animal-to-human transmission (19). Gradual deletions in the ORF8 region of human SARS-CoVs found in the early phase, the middle phase, and the late phase of the epidemic of SARS did not apparently affect the survival of the virus (1, 24, 25). Thus, these viruses should be taken seriously and monitored to prevent the occurrence of an epidemic in the future. Many previous studies demonstrated the capacity of bat SARSr-CoVs (such as WIVI and Rs4874) to use ACE2 orthologs for cell internalization and efficient replication in human cells, which indicates that diverse variants of SARSr-CoV S proteins without deletions in their RBDs are able to use human ACE2 (13, 15). In contrast, some studies reveal that the S protein of the R. sinicus SARSr-CoV with deletions (Rp3) failed to use human, civet, and bat ACE2 for cell entry (26), so we inferred that the bat SARSr-CoVs from Guangdong with RBD deletions might not be able to use human ACE2 or directly infect humans. However, whether these viruses could infect humans by ACE2 needs further investigations. Although the bat SARSr-CoV strains from Guangdong had a low risk of infecting humans because of their inability to use hACE2 at present, we should not underestimate their cross-species abilities through recombination to obtain new S genes. In the detection of coronavirus carried by bat tissues, including heart, liver, spleen, lung, kidney, stomach, brain, anal swab, and intestine, we found that coronavirus presented only in anal swabs and intestines, indicating that this virus is transmitted through the digestive tract, in line with previous findings for SARS-CoV (27). The expression level of ACE2 in the intestines of R. sinicus was higher than those in other tissues, which might explain why the virus was detected in the intestine. In summary, the intestine of the bat is an important transmission route for the viruses carried by bats, which is also consistent with the research results of another study (28). The global pandemic of SARS-CoV-2 once again reminds us to always pay attention to mutations of viruses carried by wild animals, especially bats and other vector animals, so it is necessary to monitor the coronaviruses for the long term. In our study, we found two interesting roost caves of bats in Guangzhou with high rates of coronavirus positivity and recombination. Unfortunately, we did not collect samples before 2019, so more samples are needed for in-depth study in the future. It is well known that the full length of the viral sequence can better reflect its structure and variation information, but the full length of the virus is not easy to obtain due to the low virus content in the sample and the difficulty in culturing. We will continue to pay attention to bat caves with coronaviruses, collect as many samples as possible, and try to obtain more full-length sequences of coronaviruses to comprehensively analyze the variation pattern of coronaviruses. In conclusion, our study comprehensively analyzes the situation of coronaviruses carried by bats in Guangdong and elucidates the pattern of its transmission among bats. Meanwhile, we found that bat SARSr-CoV strains from Guangdong had a low risk of infecting humans because of their feature of being unable to use hACE2 until now, but we should not underestimate their cross-species abilities through recombination to obtain new S genes. Therefore, long-term monitoring is necessary to prevent public health incidents, and species protection provides an important reference basis.

MATERIALS AND METHODS

Bat sample collection and identification.

Samplings of bats were conducted from July 2008 to March 2021 in their natural roost caves in Guangdong (including Guangzhou, Huizhou, and Shenzhen) and Hainan, China (see Table S1 in the supplemental material). Bats were trapped, and anal swab samples were collected. Each sample was collected in 1 mL of viral transport medium (VTM), composed of Hanks’ balanced salt solution at pH 7.4 containing bovine serum albumin (BSA) (1%), amphotericin (15 μg/mL), and penicillin G (100 U/mL), and stored in a liquid nitrogen tank. Samples of 3 Rhinolophus sinicus and 3 R. affinis bats, including the heart, liver, spleen, lung, kidney, stomach, brain, and intestine, were used for quantitative real-time PCR (qRT-PCR) to determine the expression level of mRNA encoding ACE2. All samples were immediately placed into 2-mL polyethylene tubes (RNase free), stored in a liquid nitrogen tank, taken back to the laboratory, and stored at −80°C until nucleic acid extraction. Bats trapped for this study were released back into their habitat. All laboratory personnel were professionally trained and wore protective clothing to protect against biological agents before sample collection. Bat identification was initially determined in the field by morphology (Fig. S6A) and later confirmed in the laboratory by sequencing of the mitochondrial cytochrome b gene (cytb) from samples of anal swabs. PCR amplification of the cytb gene was performed using DNA extracted from the swabs of bats randomly. The primers for the cytb gene were cytb F (5′-ACAGGCTCAAACAACCCAAC-3′) and cytb R (5′-TGGCCTCCAATTCAGGTTAG-3′). A 25-μL reaction mixture was set up, containing 12.5 μL of PCR mix (Gentech, China), 10.5 μL of double-distilled water (ddH2O), 1 μL of the template, 0.5 μL of the forward primer (10 μM), and 0.5 μL of the reverse primer (10 μM). Thermal cycling was performed at 94°C for 3 min, followed by 35 cycles consisting of 94°C for 30 s, 55°C for 30 s, and 72°C for 1 min and a final extension step at 72°C for 10 min. PCR products were detected on a 1.5% agarose gel and sequenced.

Viral RNA extraction, PCR screening, and sequencing.

The anal swab samples were vortexed for 1 min, and 140 mL of the supernatant was collected from each sample after centrifugation at 3,000 rpm at 4°C for 1 min. Viral nucleic acid was extracted with a QIAamp viral RNA minikit (Qiagen, Germany) according to the manufacturer’s instructions. The tissue samples (50 mg of each sample) were ground into a powder in liquid nitrogen and transferred to a diethyl pyrocarbonate (DEPC)-treated eppendorf tubes before being volatilized by liquid nitrogen. One microgram of cDNA was synthesized using the SuperScript III first-strand synthesis system (Invitrogen, USA). RT-PCR was employed to detect the presence of coronavirus sequences using a 360-nt fragment from the S gene of bat CoV. The specific primers for the S gene were ZH2F and ZH2R (Table S2A). A 25-μL reaction mixture was set up, containing 12.5 μL of PCR mix (Gentech, China), 10.5 μL of ddH2O, 1 μL of the template, 0.5 μL of the forward primer (10 μM), and 0.5 μL of the reverse primer (10 μM). Thermal cycling was performed at 94°C for 3 min, followed by 35 cycles consisting of 94°C for 30 s, 55°C for 30 s, and 72°C for 1 min and a final extension step at 72°C for 10 min. PCR products were detected on a 1.5% agarose gel and sequenced.

Metatranscriptome detection.

A total of 50 to 1,000 ng of RNA of the positive samples (Rs56) was mixed with rRNA deletion probes for rRNA depletion. Fragmentation, synthesis of 1st and 2nd strands, end repair, and adaptor ligation were conducted for library construction as described previously (29). High-throughput sequencing was conducted by the Magigene Company (Guangzhou, China). Clean reads were de novo assembled using MEGAHIT version 1.0 (17). BWA version 0.7.17 (30) was used to align clean reads to assembled contigs. Contigs were then classified by BLASTx against the NT database using an alignment similarity of ≥80%, a length of matched areas of ≥500 nt, and an E value of ≤10−5. Contigs with significant BLASTx hits were confirmed as virus sequences.

Sequencing of full-length genomes and S genes.

The full genomic sequences of the SARSr-CoVs from positive samples (Rs56 and RaCH025) were determined by RT-PCR amplification with specific primers designed by multiple alignments of available SARS-CoV and bat SARSr-CoV sequences deposited in GenBank and additional primers from the ARTIC Network for amplifying the SARS-CoV-2 genome (https://github.com/artic-network/artic-ncov2019/blob/master/primer_schemes/nCoV-2019/V3/nCoV-2019.tsv) (Table S2A and B). The complete S genes of bat SARSr-CoVs from positive samples, with adequate amounts of RNA available, were sequenced by using primers targeted to the S genes (Table S2C). PCR products of the expected size were gel purified and subjected directly to sequencing. The sequence identity between the whole genome and different genes or regions was calculated by utilizing p-distance in MEGA X (31).

Evolution analysis.

We downloaded 97 full-length genome sequences of betacoronaviruses isolated from bats from the NCBI database (https://www.ncbi.nlm.nih.gov/) (Table S3). Phylogenetic analyses were performed based on their whole-genome sequences and their ORF1ab gene, S gene, and ORF8 gene sequences. We constructed multiple-sequence alignments of their complete genomes and individual genes using MAFFT v7.407 (32). Phylogenetic analyses were estimated using MrBayes (33) with 500,000 generations and 25% of the generations as the burn-in. The best models were determined by jModelTest v2.1.7 (34). Next, the trees were visualized and exported as vector diagrams with FigTree v1.4.4 (http://tree.bio.ed.ac.uk/software/Figtree/). The aligned sequences were scanned for recombination events by the Recombination Detection Program (RDP) (35). The potential recombination events suggested by strong P values (<10−20) were further confirmed using similarity plot and bootscan analyses implemented in Simplot v3.5.1.

Quantitative real-time PCR.

qRT-PCR was used to determine the mRNA expression levels of ACE2 in different tissues of Malayan pangolins. The total volume of the qRT-PCR mixture was 20 μL; this included 10 μL of a 2× qPCR mixture (TaKaRa, Japan), 0.5 μL of forward and reverse primers, 1 μL of the template, and remaining volume as nuclease-free water. The following procedure was used for amplification: 95°C for 30 s and 40 cycles of 95°C for 5 s, 60°C for 34 s, and 95°C for 15 s. β-Actin was used as the internal control. The primers for ACE2 of R. sinicus were RsACE2-F (5′-TTGCGTATGCCATGAGAGAG-3′), RsACE2-R (5′-ACAGATTTCCGGGTGAAGTG-3′), Rsβ-actin-F (5′-CTGGACTTTGAGCAGGAGATG-3′), and Rsβ-actin-R (5′-ATGGAGTTGAATGTGGTCTCG-3′). The primers for ACE2 of R. affinis were RaACE2-F (5′-TTCCCAAAGAGGAGTGGATG-3′), RaACE2-R (5′-ACAGGGCTTCATGAAACTGG-3′), Raβ-actin-F (5′-CTGGACTTTGAGCAGGAGATG-3′), and Raβ-actin-R (5′-GTTGAATGTGGTCTCGTGGAT-3′). The expression level of mRNA encoding ACE2 was separately determined using the comparative threshold cycle (C) (2−ΔΔ) method. Means and standard deviations (SD) were calculated from biological replicates. Statistical significance was measured using the independent-sample t test by SPSS 17.0.

Ethics statement.

The animal study was reviewed and approved by the Committee on the Ethics of Animal Experiments of the Institute of Zoology of the Guangdong Academy of Sciences.

Data availability.

The data supporting this study are openly available at the NCBI Sequence Read Archive (SRA) and GenBank as follows: 28 sequences obtained from positive samples by RT-PCR have been deposited in GenBank under accession numbers OM263332-OM263359; S gene sequences obtained from bats (Rs56, Rs67, Rs68, Rs87, RaCH025, RaCH027, RaCH039 and Rs200609) have been deposited in GenBank under accession numbers OM222057-OM222064; metagenomics data obtained from anal swab from bat Rs56 have been deposited in SRR13787015 under BioProject accession number PRJNA705004; whole-genome sequences obtained from Rs56 and RaCH025 have been deposited in GenBank under accession numbers MW681002 and OM240725, respectively.
  34 in total

1.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

Review 2.  MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices.

Authors:  Dinghua Li; Ruibang Luo; Chi-Man Liu; Chi-Ming Leung; Hing-Fung Ting; Kunihiko Sadakane; Hiroshi Yamashita; Tak-Wah Lam
Journal:  Methods       Date:  2016-03-21       Impact factor: 3.608

3.  Isolation and Characterization of a Novel Bat Coronavirus Closely Related to the Direct Progenitor of Severe Acute Respiratory Syndrome Coronavirus.

Authors:  Xing-Lou Yang; Ben Hu; Bo Wang; Mei-Niang Wang; Qian Zhang; Wei Zhang; Li-Jun Wu; Xing-Yi Ge; Yun-Zhi Zhang; Peter Daszak; Lin-Fa Wang; Zheng-Li Shi
Journal:  J Virol       Date:  2015-12-30       Impact factor: 5.103

4.  Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic.

Authors:  Maciej F Boni; Philippe Lemey; Xiaowei Jiang; Tommy Tsan-Yuk Lam; Blair W Perry; Todd A Castoe; Andrew Rambaut; David L Robertson
Journal:  Nat Microbiol       Date:  2020-07-28       Impact factor: 17.745

5.  Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor.

Authors:  Xing-Yi Ge; Jia-Lu Li; Xing-Lou Yang; Aleksei A Chmura; Guangjian Zhu; Jonathan H Epstein; Jonna K Mazet; Ben Hu; Wei Zhang; Cheng Peng; Yu-Ji Zhang; Chu-Ming Luo; Bing Tan; Ning Wang; Yan Zhu; Gary Crameri; Shu-Yi Zhang; Lin-Fa Wang; Peter Daszak; Zheng-Li Shi
Journal:  Nature       Date:  2013-10-30       Impact factor: 49.962

Review 6.  Bat Coronaviruses in China.

Authors:  Yi Fan; Kai Zhao; Zheng-Li Shi; Peng Zhou
Journal:  Viruses       Date:  2019-03-02       Impact factor: 5.048

7.  ORF8-Related Genetic Evidence for Chinese Horseshoe Bats as the Source of Human Severe Acute Respiratory Syndrome Coronavirus.

Authors:  Zhiqiang Wu; Li Yang; Xianwen Ren; Junpeng Zhang; Fan Yang; Shuyi Zhang; Qi Jin
Journal:  J Infect Dis       Date:  2015-10-03       Impact factor: 5.226

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Deciphering the bat virome catalog to better understand the ecological diversity of bat viruses and the bat origin of emerging infectious diseases.

Authors:  Zhiqiang Wu; Li Yang; Xianwen Ren; Guimei He; Junpeng Zhang; Jian Yang; Zhaohui Qian; Jie Dong; Lilian Sun; Yafang Zhu; Jiang Du; Fan Yang; Shuyi Zhang; Qi Jin
Journal:  ISME J       Date:  2015-08-11       Impact factor: 10.302

10.  Fatal swine acute diarrhoea syndrome caused by an HKU2-related coronavirus of bat origin.

Authors:  Peng Zhou; Hang Fan; Tian Lan; Xing-Lou Yang; Wei-Feng Shi; Wei Zhang; Yan Zhu; Ya-Wei Zhang; Qing-Mei Xie; Shailendra Mani; Xiao-Shuang Zheng; Bei Li; Jin-Man Li; Hua Guo; Guang-Qian Pei; Xiao-Ping An; Jun-Wei Chen; Ling Zhou; Kai-Jie Mai; Zi-Xian Wu; Di Li; Danielle E Anderson; Li-Biao Zhang; Shi-Yue Li; Zhi-Qiang Mi; Tong-Tong He; Feng Cong; Peng-Ju Guo; Ren Huang; Yun Luo; Xiang-Ling Liu; Jing Chen; Yong Huang; Qiang Sun; Xiang-Li-Lan Zhang; Yuan-Yuan Wang; Shao-Zhen Xing; Yan-Shan Chen; Yuan Sun; Juan Li; Peter Daszak; Lin-Fa Wang; Zheng-Li Shi; Yi-Gang Tong; Jing-Yun Ma
Journal:  Nature       Date:  2018-04-04       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.