| Literature DB >> 28050020 |
Xiang Yong Oong1, Kim Tien Ng2, Yutaka Takebe1,3, Liang Jie Ng4, Kok Gan Chan5, Jack Bee Chook2, Adeeba Kamarulzaman2, Kok Keng Tee1.
Abstract
Human coronavirus OC43 (HCoV-OC43) is commonly associated with respiratory tract infections in humans, with five genetically distinct genotypes (A to E) described so far. In this study, we obtained the full-length genomes of HCoV-OC43 strains from two previously unrecognized lineages identified among patients presenting with severe upper respiratory tract symptoms in a cross-sectional molecular surveillance study in Kuala Lumpur, Malaysia, between 2012 and 2013. Phylogenetic, recombination and comparative genomic analyses revealed two distinct clusters diverging from a genotype D-like common ancestor through recombination with a putative genotype A-like lineage in the non-structural protein (nsp) 10 gene. Signature amino acid substitutions and a glycine residue insertion at the N-terminal domain of the S1 subunit of the spike gene, among others, exhibited further distinction in a recombination pattern, to which these clusters were classified as genotypes F and G. The phylogeographic mapping of the global spike gene indicated that the genetically similar HCoV-OC43 genotypes F and G strains were potentially circulating in China, Japan, Thailand and Europe as early as the late 2000s. The transmission network construction based on the TN93 pairwise genetic distance revealed the emergence and persistence of multiple sub-epidemic clusters of the highly prevalent genotype D and its descendant genotypes F and G, which contributed to the spread of HCoV-OC43 in the region. Finally, a more consistent nomenclature system for non-recombinant and recombinant HCoV-OC43 lineages is proposed, taking into account genetic recombination as an important feature in HCoV evolution and classification.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28050020 PMCID: PMC5285497 DOI: 10.1038/emi.2016.132
Source DB: PubMed Journal: Emerg Microbes Infect ISSN: 2222-1751 Impact factor: 7.163
Demographic and clinical profile of patients infected with HCoV-OC43 lineages 1 and 2
| Lineage 1 | MY-U208/12 | 28 March 2012 | 61 | F | Sneezing, nasal discharge, sore throat, hoarseness of voice, cough |
| MY-U413/12 | 2 May 2012 | 72 | F | Nasal discharge | |
| MY-U464/12 | 9 May 2012 | 38 | M | Nasal congestion, headache, sore throat, hoarseness of voice, cough | |
| MY-U523/12 | 18 May 2012 | 74 | M | Sneezing, cough | |
| MY-U732/12 | 25 June 2012 | 53 | F | Sneezing, nasal discharge, headache, cough | |
| MY-U868/12 | 16 July 2012 | 59 | M | Nasal congestion, cough | |
| MY-U945/12 | 1 August 2012 | 11 | M | Sneezing, nasal discharge, nasal congestion, headache | |
| MY-U1024/12 | 24 August 2012 | 61 | F | Nasal congestion | |
| MY-U1140/12 | 10 September 2012 | 21 | M | Sneezing, nasal discharge, sore throat, hoarseness of voice | |
| Lineage 2 | MY-U002/12 | 22 February 2012 | 71 | F | Nasal congestion, headache, cough |
| MY-U236/12 | 2 April 2012 | 19 | M | Nasal discharge, nasal congestion, headache | |
| MY-U710/12 | 20 June 2012 | 50 | F | Sneezing, nasal discharge, nasal congestion, cough | |
| MY-U774/12 | 2 July 2012 | 32 | F | Sneezing, nasal congestion, headache, sore throat, hoarseness of voice, cough | |
| MY-U1057/12 | 27 August 2012 | 58 | F | Sore throat, hoarseness of voice, cough | |
| MY-U1758/13 | 2 January 2013 | 56 | M | Sneezing, nasal discharge, nasal congestion, headache, sore throat, cough | |
| MY-U1975/13 | 15 February 2013 | 52 | F | Sneezing, headache, hoarseness of voice, cough | |
Abbreviations: Female, F; Male, M.
Figure 1(A) Phylogenetic analysis of the HCoV-OC43 strains based on the full-length genome. Trees were reconstructed using the neighbor-joining method and Kimura 2-parameter model in MEGA 6.0. Bootstrap values were calculated from 1000 trees. Bootstrap values >70% were indicated on the branch nodes. The scale bar of an individual tree indicates the substitutions per site. (B) Estimation of pairwise genetic distances between HCoV-OC43 genotypes based on the full-length genome sequences. Genotypes F and G were previously classified as lineages 1 and 2, respectively.[20]
Figure 2(A) Comparison of the mosaic recombination structure of the full-length genome between genotypes D, F and G. Bootscan analysis was performed using published and genotyped reference genomes for genotypes A, B, C and E as putative parental genotypes. (B) Putative parental genotype determination and confirmation in sub-genomic regions (sub-regions I–V) of genotypes D, F and F recombinants using the neighbor-joining method. Breakpoints determined by informative site analysis. Red, green and blue shades indicate genotypes A, B and C putative parental genotypes, respectively. Numbering of nucleotide (nt) positions is based on prototype ATCC VR759 reference strain.
Figure 3Signature nucleotide and amino acid substitution differences across the whole genome between genotypes F and G strains. Nucleotide and amino acid positions are numbered with a reference to HCoV-OC43 prototype strain ATCC VR759.
Figure 4Evidence from (A) amino acid substitutions and (B) bootscan analysis on the partial S gene data to confirm the presence of genotypes F and G strains. The amino acid sequence and bootscan analyses are performed on the partial S gene region (23 644–25 125 nt). Bootscan analysis is performed with a window and step size of 300 and 10 bp, respectively.
Figure 5(A) Maximum clade credibility (MCC) tree of HCoV-OC43 strains based on the 114 complete and partial global S gene data. MCC posterior probability values were indicated on the nodes of each genotype. (B) Global distribution of genotypes F and G strains.
Time of most recent common ancestor (tMRCA) for HCoV-OC43 genotypes A to G estimated based on the spike (S) gene and full-length genome
| A | 1963.8 (1959.4–1966.7) | 1964.7 (1960.3–1966.8) |
| B | 1999.1 (1996.6–2001.5) | 1999.0 (1993.5–2002.6) |
| C | 1997.2 (1994.0–1999.7) | 1998.0 (1994.3–2000.5) |
| D | 1997.3 (1994.2–2000.4) | 1999.3 (1996.5–2001.5) |
| E | 2007.9 (2005.9–2009.4) | 2003.3 (1999.1–2006.9) |
| F | 2010.8 (2010.0–2011.4) | 2009.8 (2008.1–2011.2) |
| G | 2010.7 (2010.0–2011.4) | 2010.1 (2008.5–2011.4) |
Abbreviation: highest posterior density, HPD.
Figure 6Transmission network of HCoV-OC43 genotypes D, F and G. Transmission clusters were inferred from 86 S gene sequences based on the Tamura-Nei 93 (TN93)[29] pairwise distance estimates performed using a custom script in Python (release 3.2.6) with 1000 bootstrap replicates.