| Literature DB >> 31067830 |
Susanna K P Lau1,2,3,4, Antonio C P Wong5, Libao Zhang6, Hayes K H Luk7, Jamie S L Kwok8, Syed S Ahmed9, Jian-Piao Cai10, Pyrear S H Zhao11, Jade L L Teng12,13,14, Stephen K W Tsui15, Kwok-Yung Yuen16,17,18,19, Patrick C Y Woo20,21,22,23.
Abstract
While bats are increasingly recognized as a source of coronavirus epidemics, the diversity and emergence potential of bat coronaviruses remains to be fully understood. Among 1779 bat samples collected in China, diverse coronaviruses were detected in 32 samples from five different bat species by RT-PCR. Two novel alphacoronaviruses, Rhinolophus sinicus bat coronavirus HKU32 (Rs-BatCoV HKU32) and Tylonycteris robustula bat coronavirus HKU33 (Tr-BatCoV HKU33), were discovered from Chinese horseshoe bats in Hong Kong and greater bamboo bats in Guizhou Province, respectively. Genome analyses showed that Rs-BatCoV HKU32 is closely related to BatCoV HKU10 and related viruses from diverse bat families, whereas Tr-BatCoV HKU33 is closely related to BtNv-AlphaCoV and similar viruses exclusively from bats of Vespertilionidae family. The close relatedness of Rs-BatCoV HKU32 to BatCoV HKU10 which was also detected in Pomona roundleaf bats from the same country park suggests that these viruses may have the tendency of infecting genetically distant bat populations of close geographical proximity with subsequent genetic divergence. Moreover, the presence of SARSr-CoV ORF7a-like protein in Rs-BatCoV HKU32 suggests a common evolutionary origin of this accessory protein with SARS-CoV, also from Chinese horseshoe bats, an apparent reservoir for coronavirus epidemics. The emergence potential of Rs-BatCoV HKU32 should be explored.Entities:
Keywords: Alphacoronavirus; bat; coronavirus; discovery; epidemics; molecular; novel; species
Mesh:
Year: 2019 PMID: 31067830 PMCID: PMC6563315 DOI: 10.3390/v11050423
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Map of southern China showing locations where bat coronaviruses were found. Black bat represents the location with bats positive for Hi-BatCoV HKU10; orange bat represents the location with bats positive for coronavirus (CoV) PREDICT CoV-37; blue bat represents the location with bats positive for Rs-BatCoV HKU32; green bat represents the location with bats positive for Tr-BatCoV HKU33; grey bat represents the location with bats positive for severe acute respiratory syndrome related (SARSr) BatCoV; yellow bat represents the location with bats positive for Ty-BatCoV HKU4. Provinces where samples were collected are in red font.
Detection of CoVs in different bat species by reverse transcription (RT)-polymerase chain reaction (PCR) of the 440-bp fragment of RNA-dependent RNA polymerase (RdRp) gene.
| Scientific Name | Common Name | No. of Bats Captured | No. of Bats Positive for CoV / (%) | CoV Detected | Sampling Location of Bats |
|---|---|---|---|---|---|
|
| Greater short-nosed fruit bat | 3 | 0 | - | SWH |
|
| Great roundleaf bat | 3 | 0 | - | GZ |
|
| Intermediate roundleaf bat | 21 | 0 | - | GZ |
|
| Pomona leaf-nosed bat | 182 | 2 / (1.1) | Hi-BatCoV HKU10 | TLC13, GD |
|
| Chinese pipistrelle | 2 | 0 | - | LMHP |
|
| Western bent-winged bat | 1 | 0 | - | SK01 |
|
| Small bent-wing bat | 56 | 0 | - | LMH, SWH, SK01 |
|
| Common bent-wing bat | 23 | 0 | - | SK01 |
|
| Eastern bent-wing bat | 1 | 0 | - | LMHP |
|
| Large myotis | 10 | 0 | - | SK01, GZ |
|
| Rickett’s big-footed bat | 93 | 1 / (1.1) | Coronavirus PREDICT CoV-37 | LMH01, SK01 |
|
| Common noctule | 1 | 0 | - | YSO |
|
| Japanese pipistrelle | 6 | 0 | - | MPO, YSO, KKSH |
|
| Least pipistrelle | 4 | 0 | - | KKSH, YSO, SWH, LMHP |
|
| Intermediate horseshoe bat | 76 | 0 | - | TLC01, TLC13, SK01 |
|
| Pearson’s horseshoe bat | 2 | 0 | - | GDP |
|
| Least horseshoe bat | 17 | 0 | - | TLC13 |
|
| Chinese horseshoe bat | 272 | 10 / (3.7) | Rs-BatCoV HKU32 (7) | TLC01, GDP |
|
| Lesser bamboo bat | 240 | 18 / (7.5) | Ty-BatCoV HKU4 | WKT, PFL, SWH, TLC01, GZP |
|
| Greater bamboo bat | 104 | 1 / (0.96) | Tr-BatCoV HKU33 | GZP |
GD, Guangdong Province; GDP, Guangdong Province—Conghua City; GZ, Guizhou Province; GZP, Guizhou Province—Luodian County; KKSH, Kai Kuk Shue Ha, Luk Keng; LMH, Lin Ma Hang Lead Mine; LMHP, Lin Ma Hang Pool; MPO, Mai Po Nature Reserve; PFL, Pok Fu Lam; SK01, Sai Kung; SWH, Sheung Wo Hang, Sha Tau Kok; TLC01, Tai Lam-Shek Kong; TLC13, Tai Lam-Shek Kong; WKT, Wu Kau Tang; YSO, Yung Shue O Stream, Sai Kun.
Coding potential and predicted domains in different proteins of Rs-BatCoV HKU32 strain TLC28A.
| Putative TRS | ||||||
|---|---|---|---|---|---|---|
| ORF | Nucleotide Positions (Start–End) | No. of Nucleotides | No. of Amino Acids | Frame(s) | Nucleotide Position in Genome | TRS Sequence (Distance (No. of Bases) to AUG) 1 |
| 1ab | 291–20,428 | 20,137 | 6712 | +2, +3 | 69 | AA |
| nsp1 | 291–875 | 585 | 195 | +3 | ||
| nsp2 | 876–2963 | 2088 | 696 | +3 | ||
| nsp3 | 2964–7649 | 4686 | 1562 | +3 | ||
| nps4 | 7650–9083 | 1434 | 478 | +3 | ||
| nsp5 | 9084–9989 | 906 | 302 | +3 | ||
| nsp6 | 9990–10817 | 828 | 276 | +3 | ||
| nsp7 | 10,818–11,066 | 249 | 83 | +3 | ||
| nsp8 | 11,067–11,651 | 585 | 195 | +3 | ||
| nsp9 | 11,652–11,975 | 324 | 108 | +3 | ||
| nsp10 | 11,976–12,383 | 408 | 136 | +3 | ||
| nsp11 | 51 | 17 | +3 | |||
| nsp12 | 12,384–15,163 | 2780 | 927 | +2 | ||
| nsp13 | 15,164–16,954 | 1791 | 597 | +2 | ||
| nsp14 | 16,955–18,508 | 1554 | 518 | +2 | ||
| nsp15 | 18,509–19,525 | 1017 | 339 | +2 | ||
| nsp16 | 19,526–20,428 | 903 | 300 | +2 | ||
| S | 20,430–24,485 | 4056 | 1351 | +2 | 20,421 | AA |
| ORF3 | 24,485–25,153 | 669 | 222 | +3 | 24,279 | TC |
| ORF4 | 25,184–25,543 | 360 | 119 | +2 | ||
| ORF5a | 25,544–25,888 | 345 | 114 | +2 | 25,540 | GA |
| ORF5b | 25,782–26,225 | 444 | 147 | +3 | ||
| E | 26,209–26,433 | 225 | 74 | +1 | 26,140 | AA |
| M | 26,440–27,126 | 687 | 228 | +1 | 26,430 | GT |
| N | 27,137–28,279 | 1143 | 380 | +2 | 27,128 | AA |
| ORF9 | 28,251–28,568 | 318 | 105 | +3 | 28,187 | AG |
| ORF10 (SARS-CoV ORF7a-like protein) | 28,593–28,955 | 363 | 120 | +3 | 28,284 | AA |
1 TRS sequences are shown in bold.
Figure 2Genome organizations of Rs-BatCoV HKU32 strain TLC28A and Tr-BatCoV HKU33 strain GZ151867. Genes for ORF 1a and 1b of Rs-BatCoV HKU32 strain TLC28A and Tr-BatCoV HKU33 strain GZ151867 are represented by red and orange boxes, respectively. Genes for spike protein (S), envelope protein (E), membrane protein (M) and nucleocapsid protein (N) are represented by blue boxes. Genes for putative accessory proteins are represented by yellow boxes.
Characteristics of putative nonstructural proteins of open reading frame (ORF)1ab in Rs-BatCoV HKU32 strain TLC28A, Tr-BatCoV HKU33 strain GZ151867, BatCoV HKU10 and BtNv-AlphaCoV/SC2013.
| Amino Acids | |||||
|---|---|---|---|---|---|
| nsp | Putative Function or Domain | Rs-BatCoV HKU32 | Ro-BatCoV HKU10 183A | Tr-BatCoV HKU33 | BtNv-AlphaCoV/SC2013 |
| nsp1 | Unknown | M1 – A195 | M1 – A 195 | M1 – A193 | M1 – A193 |
| nsp2 | Unknown | P196 – G891 | K196 – G888 | K194 – G771 | K194 – G771 |
| nsp3 | ADRP, Putative PLpro | G892 – G2453 | S889 – G2518 | G772 – G2339 | G772 – G2338 |
| nsp4 | Hydrophobic domain | S2454 – Q2931 | S2519 – Q 2996 | G2340 – Q2817 | G2339 – Q2815 |
| nsp5 | 3CLpro | S2932 – Q3233 | S2997 – Q3298 | S2818 – Q3119 | A2816 – Q3117 |
| nsp6 | Hydrophobic domain | S3234 – Q3509 | S3299 – Q3574 | G3120 – Q3398 | S3118 – Q3395 |
| nsp7 | Unknown | S3510 – Q3592 | S3575 – Q3657 | S3399 – Q3481 | S3396 – Q3478 |
| nsp8 | Unknown | S3593 – Q3787 | S3658 – Q3852 | S3482 – Q3676 | S3479 – Q3673 |
| nsp9 | Unknown | N3788 – Q3895 | N3853 – Q3960 | N3677 – Q3784 | N3674 – Q3781 |
| nsp10 | Unknown | A3896 – Q4031 | A3961 – Q4097 | A3785 – Q3919 | A3782 –Q3916 |
| nsp11 | Unknown | S4032 – D4048 | A4098 – Q4115 | T3920 – D3936 | A3917 – D3933 |
| nsp12 | RdRp | S4032 – Q4958 | A4098 – Q5024 | T3920 – Q4846 | A3917 – Q4843 |
| nsp13 | Hel | A4959 – Q5555 | S5025 – Q5621 | S4847 – Q5443 | S4844 – Q5440 |
| nsp14 | ExoN, N7-MTase | S5556 – Q6073 | A5622 – Q6139 | A5444 – Q5960 | S5441 – Q5958 |
| nsp15 | NendoU | G6074 – Q6412 | S6140 – Q6478 | S5961 – Q6299 | G5959 – Q6297 |
| nsp16 | O-MT | A6413 – K6712 | S6479 – R6780 | S6300 – Y6591 | S6298 – Y6589 |
Cleavage site used between nsp in Rs-BatCoV HKU32 strain TLC28A, Tr-BatCoV HKU33 strain GZ151867, BatCoV HKU10 and BtNv-AlphaCoV/SC2013.
| nsp | Cleavage Site | |||
|---|---|---|---|---|
| Rs-BatCoV HKU32 Strain TLC28A | Ro-BatCoV HKU10 183A | Tr-BatCoV HKU33 Strain GZ151867 | BtNv-AlphaCoV/SC2013 | |
| nsp1/nsp2 | A/P | A/K | A/K | A/K |
| nsp2/nsp3 | G/G | G/S | G/G | G/G |
| nsp3/nsp4 | G/S | G/S | G/G | G/G |
| nsp4/nsp5 | Q/S | Q/S | Q/S | Q/A |
| nsp5/nsp6 | Q/S | Q/S | Q/G | Q/S |
| nsp6/nsp7 | Q/S | Q/S | Q/S | Q/S |
| nsp7/nsp8 | Q/S | Q/S | Q/S | Q/S |
| nsp8/nsp9 | Q/N | Q/N | Q/N | Q/N |
| nsp9/nsp10 | Q/A | Q/A | Q/A | Q/A |
| nsp10/nsp12 | Q/S | Q/A | Q/T | Q/A |
| nsp12/nsp13 | Q/A | Q/S | Q/S | Q/S |
| nsp13/nsp14 | Q/S | Q/A | Q/A | Q/S |
| nsp14/nsp15 | Q/G | Q/S | Q/S | Q/G |
| nsp15/nsp16 | Q/A | Q/S | Q/S | Q/S |
Pairwise comparison of Coronaviridae-wide conserved domains in replicase polyprotein 1ab and overall replicase polyprotein 1ab between Rs-BatCoV HKU32 strain TLC28A and other alphaCoVs.
| Replicase | Pairwise Sequence Identity with Rs-BatCoV HKU32 Strain TLC28A (%) | ||||
|---|---|---|---|---|---|
| BtRf-AlphaCoV/ | BtMs-AlphaCoV/ | Ro-BatCoV HKU10 | PEDV | Tr-BatCoV HKU33 strain GZ151867 | |
| nsp3 | 67.6 | 67.6 | 60.3 | 50.1 | 49.0 |
| nsp5 | 84.8 | 84.8 | 81.5 | 74.2 | 75.2 |
| nsp12 | 92.6 | 92.6 | 90.1 | 83.2 | 83.7 |
| nsp13 | 94.1 | 94.3 | 92.1 | 85.6 | 80.9 |
| nsp14 | 93.4 | 93.4 | 90.0 | 79.9 | 78.8 |
| nsp15 | 89.4 | 89.4 | 83.8 | 76.7 | 78.2 |
| nsp16 | 89.7 | 90.0 | 85.8 | 82.8 | 81.2 |
| 7 Concatenated Domains | 83.2 | 83.3 | 78.6 | 70.3 | 69.5 |
| Overall replicase pp1ab | 80.1 | 80.5 | 75.0 | 65.8 | 63.3 |
Coding potential and predicted domains in different proteins of Tr-BatCoV HKU33 strain GZ151867.
| Putative TRS | ||||||
|---|---|---|---|---|---|---|
| ORF | Nucleotide Positions (Start–End) | No. of Nucleotides | No. of Amino Acids | Frame(s) | Nucleotide Position in Genome | TRS Sequence (Distance (No. of Bases) to AUG) 1 |
| 1ab | 278–20,052 | 19,774 | 6591 | +1, +2 | 54 | AA |
| nsp1 | 278–856 | 579 | 193 | +2 | ||
| nsp2 | 857–2590 | 1734 | 578 | +2 | ||
| nsp3 | 2591–7294 | 4704 | 1568 | +2 | ||
| nps4 | 7295–8728 | 1434 | 478 | +2 | ||
| nsp5 | 8729–9634 | 906 | 302 | +2 | ||
| nsp6 | 9635–10,471 | 837 | 279 | +2 | ||
| nsp7 | 10,472–10,532 | 249 | 83 | +2 | ||
| nsp8 | 10,533–11,305 | 585 | 195 | +2 | ||
| nsp9 | 11,306–11,629 | 324 | 108 | +2 | ||
| nsp10 | 11,630–12,034 | 405 | 135 | +2 | ||
| nsp11 | 51 | 17 | +2 | |||
| nsp12 | 12,035–14,814 | 2780 | 927 | +1 | ||
| nsp13 | 14,815–16,605 | 1791 | 597 | +1 | ||
| nsp14 | 16,606–18,156 | 1551 | 517 | +1 | ||
| nsp15 | 18,157–19,173 | 1017 | 339 | +1 | ||
| nsp16 | 19,174–20,052 | 879 | 292 | +1 | ||
| S | 20,053–24,150 | 4098 | 1365 | +1 | 20,049 | GA |
| ORF3 | 24,150–24,755 | 606 | 201 | +3 | 23,876 | AT |
| E | 24,777–25,004 | 228 | 75 | +3 | 24,763 | TT |
| M | 25,011–25,697 | 687 | 228 | +3 | 25,001 | GT |
| N | 25,706–26,977 | 1272 | 423 | +2 | 25,699 | AA |
| ORF7 | 26,989–27,348 | 360 | 119 | +1 | 26,982 | AA |
1 TRS sequences are shown in bold.
Pairwise comparison of Coronaviridae-wide conserved domains in replicase polyprotein 1ab and overall replicase polyprotein 1ab between Tr-BatCoV HKU33 strain GZ151867 and other alphaCoVs.
| Replicase Polyprotein Domain | Pairwise Amino Acid Sequence Identity with Tr-BatCoV HKU33 Strain GZ151867 (%) | ||||
|---|---|---|---|---|---|
| BtNv-AlphaCoV/SC2013 | BtRf-AlphaCoV/HuB2013 | BtMs-AlphaCoV/GS2013 | AlphaCoV BatCoV/P.kuhlii/Italy | Rs-BatCoV HKU32 Strain TLC28A | |
| nsp3 | 58.8 | 48.7 | 48.8 | 57.3 | 49.0 |
| nsp5 | 82.5 | 74.2 | 74.5 | 78.8 | 75.2 |
| nsp12 | 86.3 | 83.5 | 83.6 | 86.4 | 83.7 |
| nsp13 | 87.0 | 79.7 | 80.1 | 82.7 | 80.9 |
| nsp14 | 84.0 | 79.3 | 79.3 | 82.4 | 78.8 |
| nsp15 | 85.8 | 79.6 | 79.0 | 84.7 | 78.2 |
| nsp16 | 87.3 | 80.8 | 80.5 | 84.2 | 81.2 |
| 7 Concatenated Domains | 76.3 | 69.2 | 69.1 | 74.4 | 69.4 |
| Overall replicase pp1ab | 73.4 | 62.9 | 63.0 | 71.6 | 63.3 |
Figure 3Phylogenetic analysis of ORF1ab amino acid sequences of Rs-BatCoV HKU32 strains TLC26A and 28A, Tr-BatCoV HKU33 strain GZ151867 and other alphaCoVs. ORF1ab tree was constructed by maximum likelihood method using LG + G + I + F substitution model. The bootstrap values are calculated from 1000 trees. Tree was rooted using corresponding sequence of Middle East respiratory syndrome (MERS)-CoV (GenBank accession number YP_009047202.1). All bootstrap values are shown. The scale bar represents 5 substitutions per site. Both Rs-BatCoV HKU32 and Tr-BatCoV HKU33 are labeled with red (triangle) and blue (circle), respectively. Corresponding viral bat hosts’ families are highlighted in different colors: Red, Vespertilionidae; blue, Miniopteridae; yellow, Rhinolophidae; purple, Hipposideridae; green, Pteropodidae.
Figure 4Phylogenetic analysis of S amino acid sequences of Rs-BatCoV HKU32 strains TLC26A and 28A, Tr-BatCoV HKU33 strain GZ151867 and other alphaCoVs. S tree was constructed by maximum likelihood method using WAG + G + I + F substitution model. The bootstrap values are calculated from 1000 trees. Tree was rooted using corresponding sequence of MERS-CoV (GenBank accession number YP_009047204.1). All bootstrap values are shown. The scale bar represents 2 substitutions per site. Both Rs-BatCoV HKU32 and Tr-BatCoV HKU33 are highlighted in red (triangle) and blue (circle), respectively. Corresponding viral hosts are shown on the right.
Figure 5Rs-BatCoV HKU32 strain TLC28A mRNA leader-body junction and flanking sequences. The subgenomic ORF10 mRNA sequence are shown in alignment with the leader and genomic sequences. Identical nucleotides between the leader sequence and subgenomic mRNA sequence are labeled in green. Identical nucleotides between genome and subgenomic mRNA sequence are labeled in blue. The putative TRS is labeled in bold type, in purple and is underlined. Start codon AUG is labeled in red.