| Literature DB >> 32523779 |
Carol A Abidha1,2,3, Joyce Nyiro2, Everlyn Kamau2, Osman Abdullahi1,2, David James Nokes2,4, Charles N Agoti1,2.
Abstract
Human coronavirus OC43 (HCoV-OC43) is a major contributor to seasonal outbreaks of acute respiratory illness (ARI). The origins of locally circulating HCoV-OC43 strains and characteristics of their genetic diversity are unknown for most settings despite significance to effective HCoV control strategies. Between December 2015 and June 2016, we undertook ARI surveillance in coastal Kenya in nine outpatients and one inpatient health facility (HF). Ninety-two patient samples tested HCoV-OC43 positive and forty (43.5%) were successfully sequenced in spike (S) gene region (2,864 long, ∼70%). Phylogenetic analysis confirmed co-circulation of two distinct HCoV-OC43 clades that closely clustered with genotype G (n = 34, 85%) and genotype H (n = 6, 15%) reference strains. Local viruses within the same clade displayed low genetic diversity yielding identical sequences in multiple HF. Furthermore, the newly sequenced Kenyan viruses showed close phylogenetic relationship to other contemporaneous sampled strains (2015-16) including those originating from distant places (e.g. USA and China). Using a genetic similarity threshold of 99.1 per cent at nucleotide level, the HCoV-OC43 strains sampled globally between 1967 and 2019 fell into nine sequence clusters. Notably, some of these clusters appeared to have become extinct, or occurred only sporadically in a few geographical areas while others persisted globally for multiple years. In conclusion, we found that HCoV-OC43 strains spread rapidly both locally and across the globe with limited genetic evolution in the spike gene. Full-genome sequences that are spatio-temporally representative are required to advance understanding of the transmission pathways of this important human respiratory pathogen.Entities:
Keywords: HCoV-OC43; Kenya; epidemiology; evolutionary dynamics; spike
Year: 2020 PMID: 32523779 PMCID: PMC7266483 DOI: 10.1093/ve/veaa031
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.Geographic map of the KHDSS area showing spatial distribution of the enrolled health facilities.
Figure 2.HF and temporal distribution of HCoV-OC43 positive and spike gene sequenced samples. (a) The number of detected HCoV-OC43 by month and HF. Circle size is proportional to the number of samples (smallest is one and largest is ten samples). (b) The detections of HCoV-OC43 across seven surveillance months (December 2015–June 2016) stratified by sequencing status. (c) A boxplot showing the diagnostic real-time RT-PCR cycle threshold distribution for samples that successfully sequenced in the spike gene versus those that failed. (d) The number of HCoV-OC43 samples by month and HF that we obtained their spike sequence. Circle size is proportional to the number of samples (smallest is one and largest is five samples).
Baseline characteristics of the samples that were HCoV-OC43 positive and by sequencing status, from a study of ten health facilities in coastal Kenya, 2015–16.
| Characteristic | All | Sequenced | Failed sequencing |
|
|---|---|---|---|---|
| Total samples | 92 | 40 (43.5%) | 52 (56.5%) | |
| Facility#,a | 0.604 | |||
| Inpatient | 6 (6.5%) | 2 (5.0%) | 4 (7.7%) | |
| Outpatient | 86 (93.5%) | 38 (95.0%) | 48 (92.3%) | |
| Gender | 0.956 | |||
| Male | 44 (47.5%) | 19 (47.5%) | 25 (48.1%) | |
| Female | 48 (52.2%) | 21 (52.5%) | 27 (51.9%) | |
| Age class | 0.0695 | |||
| <1 y | 20 (21.7%) | 7 (17.5%) | 13 (25.0%) | |
| 1–4 years | 41 (44.6%) | 20 (50.0%) | 21 (40.4%) | |
| 5–17 years | 16 (17.4%) | 10 (25.0%) | 6 (11.5%) | |
| ≥18 years | 15 (16.3%) | 3 (7.5%) | 12 (23.1%) | |
| Ct value | ||||
| Mean (SD) | 26.0 (4.4) | 24.0 (3.4) | 27.6 (4.4) | < 0.001 |
| Median (IQR) | 25.9 (23.0–29.6) | 24.3 (21.3–25.8) | 29.01 (25.2–30.8) | < 0.001 |
Only one inpatient facility was included (Kilifi County Hospital, KCH) and these monitored pediatric admissions only while for the outpatient, nine health facilities were enrolled and patients of any age were eligible except neonates. P values were derived from
pearson χ2 test
two sample t-test
Wilcoxon rank-sum (Mann-whitney) test
SD stands for standard deviation
Figure 3.ML phylogenies showing the context of the Kilifi HCoV-OC43 sequenced strains. All panels included the same 323 spike sequences spanning ∼70 per cent of full spike length (283 collated from GenBank and forty that were sequenced in this study). (a) Relationship between the local sequences (Kilifi) to the global comparison dataset and the reference sequences for genotypes A–H. (b) Relationship between sequences sampled across different continents. (c) The relationship between sequences sampled across different time periods. (d) Relationship between the non-Kilifi sequences to those from Kilifi by HF.
Figure 4.Spike nucleotide diversity of the local and global HCoV-OC43 strains and spatio-temporal patterns of identified clusters. (a) Pairwise nt difference distribution between the forty sequenced Kilifi samples across the 2,864 nt long spike region. (b) The distribution of the pairwise nucleotide differences between the combined global 323 spike sequences. (c) The specific countries in which the nine assigned clusters were identified. (d) The temporal distribution of the nine clusters between 1960 and 2019 among the 323 global dataset that we analysed.
Figure 5.Phylogeny of the local OC43 strains, spatial and temporal distribution of the identified clusters. (a) A ML phylogenetic tree based on the Kilifi spike sequences (n = 40). Numbers next to nodes indicate the bootstrap support and only those >70 per cent are shown. Kilifi sequences are preceded with a circle coloured by HF. (b) A minimum spanning network to demonstrate potential transmission links between the Kilifi strains. The vertexes represent the sequenced spike gene haplotypes. The size of the vertex is proportional to the number of haplotypes (identical sequences) and is coloured by the HF, from which the sequenced sample was collected. The numbers shown on the edges represent the number of nt changes from one vertex (haplotype) to the next. (c) The proportion of Kilifi samples classifying as genotype G or H by HF. (d) The proportion of Kilifi samples that classified into either G or H genotype across the 7 months (December 2015–June 2016).
Figure 6.Multiple sequence alignments of the HCoV-OC43 strains we sequenced (n = 40). Genotypes G and H strains are separated and are each compared to a reference strain of the genotype identified in GenBank. The multi-coloured bar on the upper panel of the plot shows the relative positions of the spike protein domains in the alignment. The assigned genotype is indicated on the right-hand side of the lower panel plot. The coloured vertical small bars on the lower panel show the nt changes from the reference sequence. Red is a change to T, green is a change to A, black is a change to G, and medium blue is a change to C. Dark grey is a change to a deletion. Viruses within the genotypes are ordered by sampling date.