Literature DB >> 31727697

Genome Sequences of Human Coronavirus OC43 and NL63, Associated with Respiratory Infections in Kilifi, Kenya.

Everlyn Kamau1, Martha M Luka2,3, Zaydah R de Laurent2, Irene Adema2, Charles N Agoti2, D James Nokes2,4.   

Abstract

Coding-complete genomes of two human coronavirus OC43 strains and one NL63 strain were obtained by metagenomic sequencing of clinical samples collected in 2017 and 2018 in Kilifi, Kenya. Maximum likelihood phylogenies showed that the OC43 strains were genetically dissimilar and that the NL63 strain was closely related to NL63 genotype B viruses.
Copyright © 2019 Kamau et al.

Entities:  

Year:  2019        PMID: 31727697      PMCID: PMC6856263          DOI: 10.1128/MRA.00730-19

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Human coronaviruses (HCoVs) are enveloped viruses within the family Coronaviridae with positive-sense, single-stranded genomes of up to 32 kb, with a 5′ cap structure and 3′ polyadenylation tract (1). The 5′ two-thirds of the genome encodes the large replicases 1a and 1b encoding nonstructural proteins, whereas the 3′ one-third encodes the structural proteins (1). Coronaviruses are divided into four genera, namely, Alphacoronavirus (229E and NL63), Betacoronavirus (severe acute respiratory syndrome [SARS], Middle East respiratory syndrome [MERS], OC43, and HKU1), Gammacoronavirus, and Deltacoronavirus (2). The NL63, 229E, HKU1, and OC43 species are endemic in human populations, causing relatively mild and severe respiratory disease (2). Up to eight distinct HCoV-OC43 genotypes (3) and three HCoV-NL63 genotypes (4) have been identified. However, there is a paucity of HCoV genomes that have been collected globally, with a negligible number from Africa. Here, we report coding-complete genomes of two HCoV-OC43 strains and one NL63 strain. The isolates NL63_KLF_01_2018 and OC43_KLF_01_2018 were obtained from acute respiratory infection cases at a local school (from a 7-year-old boy with cough, nasal discharge, and abdominal pain and from a 13-year-old female student with cough, nasal discharge, and fever, respectively). OC43_KLF_02_2017 was isolated from a 2-month-old boy presenting at the Kilifi County Hospital with fever and chest in-drawing. Written informed consent for study participation was obtained from parents or guardians of the patients, and ethical approval was obtained from the KEMRI Scientific and Ethics Review Unit. Viral RNA was extracted from nasopharyngeal swabs using TRIzol LS reagent (Invitrogen) followed by TURBO DNase treatment (Invitrogen), according to the manufacturer’s instructions. cDNA was synthesized with SuperScript III reverse transcriptase (Invitrogen) with random hexamers, and double-stranded cDNA was synthesized with Klenow polymerase (5 U; New England BioLabs). Libraries were prepared using the Nextera XT kit (Illumina) according to the manufacturer’s instructions and sequenced using the MiSeq reagent kit v2 (500 cycle) (Illumina) on an Illumina MiSeq sequencer. Sequencing reads (paired, 2 × 250 bp) were filtered, kmer normalized, and de novo assembled as previously described (5) by using the settings specified in Table 1. Reference OC43 and NL63 genomes (GenBank accession numbers AY391777 and NC_005831, respectively) were used to transfer annotations to assembled contigs using Geneious (R9). MAFFT v.7.221 (6) was used for sequence alignment using the parameters “–localpair –maxiterate 1000.” Maximum likelihood phylogenies were estimated in RAxML v.8 (7) using the general time-reversible (GTR) nucleotide substitution model and gamma distribution of among site rate variation.
TABLE 1

Characteristics of the HCoV OC43 and NL63 genomes and samples from Kilifi, Kenya

IsolateGenome size (nt)G+C content (%)No. of readsb N50 (bp)c Avg depth (×)d No. of contigsrRT-PCRe CT valueGenBank accession no.
NL63_KLF_01_201827,12534.5754,45619,834348.59612720.88MN026166
OC43_KLF_01_201830,77736.6622,95424,566502.00924020.18MN026164
OC43_KLF_02_201730,31836.61,072,82627,724983.9882,21921.07MN026165

QUASR parameters, -d –q –l 125 –m 30; SPAdes parameters, —careful -k 77, 99, 127; QUAST parameters, minimum contig length, 500, ambiguity, one, and threshold for extensive misassembly size, 1,000, bowtie2 parameters, -q –S –local. nt, nucleotide; rRT-PCR, real-time reverse transcription-PCR; C, threshold cycle.

Short read length of the raw sequence data ranged from 35 to 250 bases.

N50 length is calculated by summing the lengths of contig assembly of a particular sample from the longest to the shortest and determining the point at which 50% of the assembly size is reached.

Calculated by dividing the per-position coverage output (described in the text) by respective genome length.

The rRT-PCR assay, including primers and probe sequences used for HCoV detection, has been described previously (9).

Characteristics of the HCoV OC43 and NL63 genomes and samples from Kilifi, Kenya QUASR parameters, -d –q –l 125 –m 30; SPAdes parameters, —careful -k 77, 99, 127; QUAST parameters, minimum contig length, 500, ambiguity, one, and threshold for extensive misassembly size, 1,000, bowtie2 parameters, -q –S –local. nt, nucleotide; rRT-PCR, real-time reverse transcription-PCR; C, threshold cycle. Short read length of the raw sequence data ranged from 35 to 250 bases. N50 length is calculated by summing the lengths of contig assembly of a particular sample from the longest to the shortest and determining the point at which 50% of the assembly size is reached. Calculated by dividing the per-position coverage output (described in the text) by respective genome length. The rRT-PCR assay, including primers and probe sequences used for HCoV detection, has been described previously (9). Characteristics of the three coronaviruses are listed in Table 1, and their genome organization was typical of NL63 and OC43 (8). Other viral contigs belonging to respiratory syncytial virus (RSV), adenovirus, influenza, and rhinovirus were identified using the BLASTn program, albeit in low frequencies (<0.1% of total contigs per sample). The two OC43 genomes had 98.4% (28,654 sites) pairwise identity but clustered in distinct branches of the genome-based phylogeny (Fig. 1A), while the NL63 sequence grouped with global NL63 genotype B sequences (Fig. 1B). Extensive variability, including insertions, deletions, and nonsynonymous substitutions, was observed in the S gene between Kilifi and other global strains for both NL63 and OC43 viruses. Furthermore, the number of codons and the pattern of putative N-glycosylation (N-X-S/T) varied in the spike protein between the two Kilifi OC43 strains. These new complete-coding genomes increase the available data from Africa and will be useful for future molecular epidemiology studies.
FIG 1

Maximum likelihood phylogenetic tree of 175 OC43 (A) and 56 NL63 (B) genome sequences. Reference and contemporary sequences were retrieved from GenBank. Sequences from Kilifi, Kenya, are colored red. Only nonoverlapping bootstrap values of >70 are shown in the phylogeny. Scale bars indicate expected nucleotide substitutions per site.

Maximum likelihood phylogenetic tree of 175 OC43 (A) and 56 NL63 (B) genome sequences. Reference and contemporary sequences were retrieved from GenBank. Sequences from Kilifi, Kenya, are colored red. Only nonoverlapping bootstrap values of >70 are shown in the phylogeny. Scale bars indicate expected nucleotide substitutions per site.

Data availability.

The raw sequence data were deposited in the Sequence Read Archive (SRA) under BioProject accession number PRJNA547576 and BioSample accession numbers SAMN11969662, SAMN11969663, and SAMN11969664. The genome sequences generated here were deposited in GenBank under accession numbers MN026164, MN026165, and MN026166.
  9 in total

1.  Added value of an oropharyngeal swab in detection of viruses in children hospitalized with lower respiratory tract infection.

Authors:  Laura L Hammitt; Sidi Kazungu; Steve Welch; Anne Bett; Clayton O Onyango; Rory N Gunson; J Anthony G Scott; D James Nokes
Journal:  J Clin Microbiol       Date:  2011-04-13       Impact factor: 5.948

Review 2.  Coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus.

Authors:  Susan R Weiss; Sonia Navas-Martin
Journal:  Microbiol Mol Biol Rev       Date:  2005-12       Impact factor: 11.056

3.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

4.  Genomic analysis of 16 Colorado human NL63 coronaviruses identifies a new genotype, high sequence diversity in the N-terminal domain of the spike gene and evidence of recombination.

Authors:  Samuel R Dominguez; Gregory E Sims; David E Wentworth; Rebecca A Halpin; Christine C Robinson; Christopher D Town; Kathryn V Holmes
Journal:  J Gen Virol       Date:  2012-07-25       Impact factor: 3.891

5.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

Review 6.  Epidemiology, Genetic Recombination, and Pathogenesis of Coronaviruses.

Authors:  Shuo Su; Gary Wong; Weifeng Shi; Jun Liu; Alexander C K Lai; Jiyong Zhou; Wenjun Liu; Yuhai Bi; George F Gao
Journal:  Trends Microbiol       Date:  2016-03-21       Impact factor: 17.079

7.  A novel human coronavirus OC43 genotype detected in mainland China.

Authors:  Yun Zhu; Changchong Li; Li Chen; Baoping Xu; Yunlian Zhou; Ling Cao; Yunxiao Shang; Zhou Fu; Aihuan Chen; Li Deng; Yixiao Bao; Yun Sun; Limin Ning; Chunyan Liu; Ju Yin; Zhengde Xie; Kunling Shen
Journal:  Emerg Microbes Infect       Date:  2018-10-30       Impact factor: 7.163

8.  Complete Genome Sequences of Dengue Virus Type 2 Strains from Kilifi, Kenya.

Authors:  Everlyn Kamau; Charles N Agoti; Joyce M Ngoi; Zaydah R de Laurent; John Gitonga; Matthew Cotten; My V T Phan; D James Nokes; Eric Delwart; Eduard Sanders; George M Warimwe
Journal:  Microbiol Resour Announc       Date:  2019-01-24

Review 9.  Recently discovered human coronaviruses.

Authors:  Brigitte A Wevers; Lia van der Hoek
Journal:  Clin Lab Med       Date:  2009-12       Impact factor: 1.935

  9 in total
  1 in total

1.  Transmission and evolutionary dynamics of human coronavirus OC43 strains in coastal Kenya investigated by partial spike sequence analysis, 2015-16.

Authors:  Carol A Abidha; Joyce Nyiro; Everlyn Kamau; Osman Abdullahi; David James Nokes; Charles N Agoti
Journal:  Virus Evol       Date:  2020-06-02
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.