| Literature DB >> 29258555 |
Ana Moreno1, Davide Lelli2, Luca de Sabato3,4, Guendalina Zaccaria3, Arianna Boni3, Enrica Sozzi2, Alice Prosperi2, Antonio Lavazza2, Eleonora Cella5, Maria Rita Castrucci3, Massimo Ciccozzi5, Gabriele Vaccari3.
Abstract
BACKGROUND: Middle East respiratory syndrome coronavirus (MERS-CoV), which belongs to beta group of coronavirus, can infect multiple host species and causes severe diseases in humans. Multiple surveillance and phylogenetic studies suggest a bat origin. In this study, we describe the detection and full genome characterization of two CoVs closely related to MERS-CoV from two Italian bats, Pipistrellus kuhlii and Hypsugo savii.Entities:
Keywords: Bats; Full genome sequencing; Italy; MERS-like Beta-CoV viruses; Phylogenetic and molecular analyses
Mesh:
Substances:
Year: 2017 PMID: 29258555 PMCID: PMC5735805 DOI: 10.1186/s12985-017-0907-1
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Genome localization of predicted protein sequences, putative leader TRS-L and TRS-B
| BatCoV-Ita1 | |||
| ORF | nt position (start-end) | No. of amino acids | Sequencea |
| ORF1ab (TRS-L) | 217–21,446 | 7076 | 00036GATTTTAACGAACTTAAA00053 |
| Spike | 21,388–25,425 | 1345 | 21330C.AG..........CGTT21347 |
| ORF3 | 25,438–25,749 | 103 | 25419TCAC.A.....T.....T25436 |
| ORF4a | 25,758–26,045 | 95 | 25742A.AA..........CT.T25759 |
| ORF4b | 25,963–26,724 | 253 | |
| ORF5 | 26,731–27,414 | 227 |
26,
|
| E | 27,493–27,741 | 82 | 27479TTGGAA........ATGT27496 |
| M | 27,756–28,412 | 218 | 27,734.GG...........CTCT27751 |
| N | 28,460–29,749 | 429 | 28,430............TC.TT.28447 |
| ORF8b | 28,506–29,084 | 192 | |
| BatCoV-Ita2 | |||
| ORF | nt position (start-end) | No. of amino acids | Sequencea |
| ORF1ab (TRS-L) | 208–21,437 | 7076 | 00026GATTTTAACGAACTTAAA00043 |
| Spike | 21,379–25,416 | 1345 | 21321C.AG..........CGTT21338 |
| ORF3 | 25,429–25,740 | 103 | 25410TCAC.A.....T.....T25427 |
| ORF4a | 25,749–26,036 | 95 | 25733A.AA..........CT.T25750 |
| ORF4b | 25,954–26,715 | 253 | |
| ORF5 | 26,722–27,405 | 227 |
26,
|
| E | 27,484–27,732 | 82 | 27470TTGGAA........ATGT27487 |
| M | 27,747–28,403 | 218 | 27,725.GG...........CTCT27742 |
| N | 28,451–29,740 | 429 | 28,421............TC.TT.28438 |
| ORF8b | 28,497–29,075 | 192 | |
aDots represent identical nucleotides compared to the TRS-L
Prediction of the putative pp1a/pp1b cleavage sites of BatCoV-Ita1/2 based on sequence comparison with MERS-CoV strain HCoV-EMC/2012
| NSP | Position of the putative cleavage sitesa | Protein size (no. of amino acids) | Putative functional domain(s)b |
|---|---|---|---|
| NSP1 | Met1-Gly195 | 195 | |
| NSP2 | Asn196-Gly855 | 660 | |
| NSP3 | Ala856-Gly2738 | 1883 | ADRP, PL2pro |
| NSP4 | Ala2739-Gln3245 | 507 | |
| NSP5 | Ser3246-Gln3551 | 306 | 3CLpro |
| NSP6 | Ser3552-Gln3843 | 292 | |
| NSP7 | Ser3844-Gln3926 | 83 | |
| NSP8 | Ala3927-Gln4125 | 199 | Primase |
| NSP9 | Asn4126-Gln4235 | 110 | |
| NSP10 | Ala4236-Gln4375 | 140 | |
| NSP11 | Ser4376-Ile4389 | 14 | Short peptide at the end of |
| NSP12 | Ser4376-Gln5309 | 934 | RdRp |
| NSP13 | Ala5310-Gln5907 | 598 | HEL, NTPase |
| NSP14 | Ser5908-Gln6431 | 524 | ExoN, NMT |
| NSP15 | Gly6432-Gln6773 | 342 | NendoU |
| NSP16 | Ala6774-His7076 | 303 | OMT |
aSuperscript numbers indicate positions in polyprotein pp1a/pp1ab or position in available sequence with the supposition of a ribosomal frameshift based on the conserved slippery sequenced (UUUAAAC) of Coronaviruses. Localized at nucleotide position 13,359–13,365 for BatCoV-Ita1 and 13,350–13,356 for BatCoV-Ita2
b ADRP ADP-ribose 1-phosphatase, PL2pro papain-like protease 2, 3CLpro coronavirus NSP5 protease, Hel helicase, NTPase nucleoside triphosphatase, ExoN exoribonuclease, NMT N7 methyltransferase, NendoU endoribonuclease, OMT 2’ O-methyltransferase
Comparison of the predicted pp1a/pp1b cleavage site sequencesa of BatCoV-Ita1/2 with prototype clade c betacoronaviruses and MERS related strains
| NSP1 | NSP2 | NSP3 | NSP4 | NSP5 | NSP6 | NSP7 | NSP8 | NSP9 | NSP10 | NSP11 | NSP12 | NSP13 | NSP14 | NSP15 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BatCoV-Ita1/2 | LVGG | LKGG | IVGG | LQS | MQS | VQS | LQA | LQN | LQA | TQS | RGSI | LQA | LQS | VQG | LQA |
| MERSb | -I-- | – | – | – | – | M-- | – | – | – | P-- | – | – | – | – | – |
| HKU4c | -I-- | – | – | – | – | – | – | – | – | P-- | GS-V | – | – | – | – |
| HKU5d | – | – | LS-- | – | – | – | – | – | – | P-- | – | – | – | I-- | – |
| Erinaceuse | -C-- | – | – | – | – | – | --S | – | – | LH- | – | – | – | – | – |
| NeoCoVf | -T-- | – | – | – | – | I-- | – | – | – | P-- | – | – | – | – | – |
| BtVs-BetaCoV/SC2013g | – | – | – | – | – | – | – | – | – | P-- | – | – | – | – | – |
aHyphens represent identical amino acids compared to the BatCoV-Ita1/2 sequences
bGenBank accession number JX869059
cGenBank accession number EF065505
dGenBank accession number EF065509
eGenBank accession number KC545386
fGenBank accession number KC869678
gGenBank accession number KJ473821
Fig. 1Sequence identity between IT bat CoVs and other prototype clade c betacoronaviruses and MERS related strains. Similarity plots were generated using SSE version 1.2 using a sliding window of 600 and a step size of 100 nucleotides
Fig. 2Maximum phylogenetic tree based on alpha and beta CoV full genomes. Ultrafast bootstrap approximation approach was performed to compute the support of phylogenetic groups. Best-fit model according to BIC was GTR + G4. Bat sequences close related to MERS-CoVs are reported in colors: light blue for IT bat CoVs, pink for NeoCoV from South Africa, green for PREDICT from Uganda and red for SC2013 from China. Bat-CoV/H.savii/Italy/206645–40/2011 and Bat-CoV/P.khulii/Italy/206645–63/2011 sequences can be retrieved under accession numbers MG596802 and MG596803 respectively
Comparison between predicted protein sequences of the IT bat CoVs and prototype clade c betacoronaviruses and MERS related strains
| % Amino acid identitiesa | |||||||
|---|---|---|---|---|---|---|---|
| ORF | IT Bat CoVs | MERS-CoVb | HKU4c | HKU5d | EriCoVe | NeoCovf | BtVs-BetaCoV/SC2013g |
| ORF1ab | 99.4 | 81.3-81.5 | 73.5–73.6 | 75.9–76 | 74.5–74.7 | 81.6 | 84.6–84.7 |
| Spike | 99 | 68.5–68.8 | 70.4–70.6 | 73.8–74.3 | 57.8–58.1 | 60.9 | 79.6 |
| ORF3 | 97.1 | 48.5–49.5 | 39.6–40.7 | 46.5–47.5 | 28.6 | 51.5 | 53.4 |
| ORF4a | 97.9 | 53.7 | 44.2 | 50–52.2 | 43.2–43.4 | 54.7 | 67 |
| ORF4b | 98 | 43.9–46.4 | 29.9–30.3 | 31.1–32 | 39.9–40.4 | 47.4 | 56 |
| ORF5 | 98.7 | 64.3–64.7 | 47.6 | 54.5–55 | 52.9 | 62.9 | 74 |
| E | 100 | 86.6 | 70.7 | 69.5 | 78 | 91.5 | 91.5 |
| M | 99.5 | 84.9–85.8 | 81.7–82.1 | 82.1–82.6 | 83.9–84.4 | 84.9 | 87.2 |
| N | 99.5 | 81.4–81.9 | 73–73.4 | 71.8 | 74.4–74.6 | 83.7 | 83.4 |
| ORF8b | 96.9 | 63.4–67 | 49.2–52.4 | 49.5–52.1 | 47.9–49.5 | 63–65.1 | 64–65.6 |
| Concatenated domains | 99.3 | 79 | 74 | 74.6 | 74.4 | 79.2 | 82.3 |
aCalculated with MEGA7 using a pairwise deletion option
bGenBank accession number JX869059, KC164505, KC776174, KF186567, KF192507, KF600612, KF600620, KJ477102
cGenBank accession number EF065505, EF065506, EF065507, EF065508, DQ648794
dGenBank accession number EF065509, EF065510, EF065511, EF065512
eGenBank accession number KC545386, KC545383
fGenBank accession number KC869678
gGenBank accession number KJ473821
Fig. 3Maximum phylogenetic tree based on deduced amino acid sequences of S1 protein of beta CoVs. Best-fit model according to BIC was WAG + F + I + G4. Clades 2a, 2b, 2c and 2d corresponding to the sequences of human CoVs OC43, SARS-like, MERS-like and HKU9 CoV respectively are identified
Fig. 4Comparison of secondary and tertiary structures of MERS-like CoV RBD. a Predicted 3D MMS of the core and external subdomains of HKU4 RBD and human DPP4 (right and left lateral view). The two deletions observed in IT Bat CoVs RBD are evidenced in green. b Structure –based sequence alignment . The secondary structure elements are defined based on an ESPript algorithm and are labeled as in a previous report on the MERS RBD structure [24]. Spiral lines indicate helices, while arrows represent β strands. The external subdomain is highlighted by enclosure in a blue box. The two deletions found in IT Bat CoVs and HKU5 RBD are marked with blue lines. The Arabic numerals 1–4 indicate cysteine residues that pair to form disulfide bonds