Literature DB >> 33201914

Genetic diversity of SARS-CoV-2 and clinical, epidemiological characteristics of COVID-19 patients in Hanoi, Vietnam.

Tam Thi Nguyen1, Thach Ngoc Pham2, Trang Dinh Van2, Trang Thu Nguyen1, Diep Thi Ngoc Nguyen1, Hoa Nguyen Minh Le2, John-Sebastian Eden3,4, Rebecca J Rockett3,5, Thuong Thi Hong Nguyen1, Bich Thi Ngoc Vu1, Giang Van Tran6,7, Tan Van Le8, Dominic E Dwyer3,5,9, H Rogier van Doorn1,10.   

Abstract

A second cluster of COVID-19 cases imported from Europe occured in Vietnam from early March 2020. We describe 44 SARS-CoV-2 RT-PCR positive patients (cycle threshold value <30) admitted to the National Hospital for Tropical Diseases in Hanoi between March 6 and April 15 2020. Whole SARS-CoV-2 genomes from these patients were sequenced using Illumina Miseq and analysed for common genetic variants and relationships to local and globally circulating strains. Results showed that 32 cases were Vietnamese with a median age of 37 years (range 15-74 years), and 23 were male. Most cases were acquired outside Vietnam, mainly from the UK (n = 15), other European countries (n = 14), Russia (n = 6) and countries in Asia (n = 3). No cases had travelled from China. Forty-one cases had symptoms at admission, typically dry cough (n = 36), fever (n = 20), sore throat (n = 14) and diarrhoea (n = 12). Hospitalisation was long with a median of 25 days, most commonly from 20-29 days. All SARS-CoV-2 genomes were similar (92-100% sequence homology) to the reference sequence Wuhan_1 (NC_045512), and 32 strains belonged to the B.1.1 lineage. The three most common variants were linked, and included C3037T, C14408T (nsp12: P323L) and A23403G (S: D614G) mutations. This group of mutations often accompanied variant C241T (39/44 genomes) or GGG 28881..28883 AAC (33/44 genomes). The prevalence of the former reflected probable European origin of viruses, and the transition D614G was dominant in Vietnam. New variants were identified; however, none could be associated with disease severity.

Entities:  

Mesh:

Year:  2020        PMID: 33201914      PMCID: PMC7671498          DOI: 10.1371/journal.pone.0242537

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Coronavirus Disease 2019 (COVID-19) is caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and has spread to 218 countries and territories worldwide, leading to more than 42,500,000 confirmed cases with 1,147,301 deaths by October 25 2020 [1]. The number of deaths due to COVID-19 has increased significantly since April 2020, with between 3000 to 6000 deaths daily [2]. SARS-CoV-2 belongs to family Coronaviridae, genus Betacoronavirus and subgenus Sarbecovirus, and is one of seven coronaviruses causing human disease—NL63, 229E, OC43, HKU1, SARS-CoV and Middle East Respiratory Syndrome CoV (MERS-CoV) [3]. The SARS-CoV-2 genome has about 89% nucleotide sequence similarity with SARS-like CoVs (bat SL-CoVZC45, SARS-CoV Tor2) and is divided into two untranslated terminal regions (UTR) and 14 open reading frames (ORFs). The structural ORFs consist of Spike (S), Envelope (E), Membrane (M) and Nucleocapsid (N) proteins, while other regions code for non-structural or accessory proteins [4]. On January 23 2020, the Ministry of Health in Vietnam reported the first imported case of SARS-CoV-2 infection, someone who had recently travelled from Wuhan in Hubei Province, China. A total of 16 cases were recorded in this first cluster, but more SARS-CoV-2 importation and transmission in Vietnam happened from March 2020 onward, when residents and tourists entered Vietnam from Europe, USA and elsewhere in Asia [5]. Notably, local community transmission was successfully suppressed for 99 days from the April 15 to July 24 2020 [5]. The cases recently detected in Da Nang have triggered the third cluster of COVID-19 in Vietnam, increasing local and hospital acquired transmission and deaths. There are a number of reports describing the clinical, epidemiological and genetic features of COVID-19 cases and SARS-CoV-2 in Vietnam, although whole genome sequencing (WGS) data is limited [6-18]. Here, we report the clinical, epidemiological and WGS features and correlations of 44 patients with COVID-19 in Vietnam.

Materials and methods

Samples collection and molecular diagnosis

Nasopharyngeal and oropharyngeal swabs of all suspected COVID-19 cases quarantined or hospitalised/isolated at the National Hospital for Tropical Diseases (NHTD) were collected using sterile cotton buds and stored in in-house produced viral transport medium. Samples were tested within 24 hours of collection. Total RNA was extracted with the QIAamp viral RNA mini kit (Qiagen, Hilden, Germany) and tested with a real-time reverse transcription PCR (real-time RT-PCR) to check for the presence of the E gene (112bp) and RNA-dependent RNA polymerase (RdRp) gene (99bp) according to the World Health Organization’s protocol using SuperScript III Platinum One-Step qRT-PCR kit (Invitrogen, Carlsbad, CA USA) and E-Sarbeco, RdRp primers and probes (Tib Molbiol, Berlin, Germany) [19]. Positive samples with cycle threshold (Ct) values <30 for both genes were stored at -80°C for sequencing.

cDNA synthesis, PCR amplification and whole genome sequencing

We obtained 44 real-time RT PCR positive samples (Ct < 30) diagnosed from March 6 to April 15 2020, and SARS-CoV-2 WGS was performed as described elsewhere [20, 21]. Briefly, RNA was converted into cDNA, then amplified into 14 PCR products spanning the SARS-CoV-2 reference genome (Accession: MN 908947-Wuhan-Hu-1). PCR products (about 2.5kb) were checked by electrophoresis using a 1% agarose gel running at 110V for 30 minutes. PCR fragments of each sample were then pooled and purified by AMPure XP beads (Beckman Coulter, USA). Library preparation used the Nextera XT Library preparation kit (Illumina, USA), and sequencing was performed on an Illumina Miseq platform with the 300 cycle v2 or 300 cycle micro v2 kit (Illumina, USA) following manufacturer’s instructions.

Collection of clinical and epidemiological information

Real-time RT PCR positive patients were admitted to the NHTD, the first-line national hospital for COVID-19 treatment in northern Vietnam. Medical staff recorded patient demographics, travel history and contacts within the previous 14 days. Admission signs, symptoms and health history were obtained by direct patient communication, and followed up during hospitalisation. Treatments, chest radiography or computed tomography (CT) and clinical outcomes of 44 patients were recorded. This study was approved by the Institutional Review Board of the National Hospital for Tropical Diseases, Hanoi, Vietnam. The need for participant consent was waived by the Ethics Committee of the National Hospital for Tropical Diseases (Decision No.02A/HDDD-NDTU issued on March 30 2020).

Data analysis

Consensus assembly and variant detection

A quality control, assembly and variant calling workflow for Illumina data with the Nextera XT library preparation kit was used [22]. The workflow performed on CLC genomics workbench version 20.0.3 included raw reads firstly trimmed based on read quality and to remove adapters. Trimmed reads were mapped to a data set of eight SARS-CoV-2 genomes from Vietnam in March 2020 submitted to GISAID (https://www.gisaid.org/). This resulted in reads specific to SARS-CoV-2 and non-SARS-CoV-2 reads which primarily mapped to the human reference genome (GCA_000001405.15_GRCh38). Viral reads were mapped to the Wuhan SARS-CoV-2 reference genome (NC_045512) producing a consensus sequence and calling sequence variants of 44 viral isolates. The workflow finally created reports (tables or graphic files) and a track list to summarise results of all steps. Based on variant/amino acid change tables, we filtered true mutations which met a coverage of >30x and greater than 70% of total reads carrying the modification. Novel variants and deletions were confirmed by Sanger sequencing. The viral consensus sequences were submitted to the GLUE website (http://cov-glue.cvr.gla.ac.uk) to confirm whether any mutations were novel compared to the GISAID database. We combined the mutation report from GLUE with CLC genomics workbench to remove modifications that did not satisfy the threshold for mutation selection. QualiMap application [23] was used to evaluate sequencing depth across the reference genome and calculate mean coverage values.

Phylogenetic analysis of study sequences in global context

The study sequences were genotyped using the Pangolin [24] web server (https://pangolin.cog-uk.io/), which revealed that all 44 viruses were of the SARS-CoV-2 B lineage. To examine the diversity and potential sources, study sequences were compared to a random selection of 1000 globally circulating B lineage viruses and other Vietnamese sequences (41 sequences including 5 A lineage viruses) available on GISAID with collection dates reflecting the study period between December 1st 2019 and April 15th 2020. All sequences were first aligned against the prototype strain Wuhan-Hu-1/2019 (GenBank accession NC_045512) with MAFFT [25] before sequences with excessive ambiguities (>400 sites) and divergence (>25 SNPs from prototype) were removed along with those without a specific day of sample collection. The refined alignment was then analysed phylogenetically using a maximum likelihood approach in RAxML [26] with the GTR+G substitution model and 1000 bootstrap replicates. The final tree was visualised and annotated with FigTree [27]. Cluster identification was based on the phylogenetic analysis showing monophyletic clades with a bootstrap value greater than 70.

Metadata analysis

Metadata distributed in range and continuously, and their mean and standard deviation (SD) values were reported. Other categorical variables were counted and percentage given per total sample. Microsoft Excel was used to statistically analyze these data.

Results

Demographic and epidemiological features of patients with COVID-19 in Vietnam

This study enrolled 44 patients who were selected for inclusion based on a positive SARS-CoV-2 real-time RT PCR result with Ct values of E and RdRp gene <30. The median age of all patients was 37 years (range 15–74), with the 20–39 years age group the most prevalent, accounting for 20 cases (Table 1). Thirty-two patients were of Vietnamese nationality, and the imported cases had travelled from United Kingdom (n = 15), Russia (n = 6), Germany (n = 5), France (n = 4), Italy (n = 2), Spain (n = 2), Netherlands (n = 1) and countries in Asia (n = 3). The male:female ratio was 23:21.
Table 1

Demographic and epidemiological characteristics of laboratory-confirmed COVID-19 cases in Vietnam.

Cases (44)Cases (44)
n (%)n (%)
GenderSource of transmission
 Male23 (52) Contact with COVID-19 case in Vietnam12 (27)
 Female21 (48) International travel in past 14 days25 (57)
Age Both5 (11)
 Mean (SD)37.36 (16.82) Unknown2 (5)
 Range15–74International travel from country
 0–90 China0
 10–196 (14) India1 (2)
 20–3920 (45) Singapore (transit)1 (2)
 40–5912 (27) Japan1 (2)
 60–796 (14) UK15 (34)
 ≥800 France4 (9)
Nationality Germany5 (11)
 Vietnamese32 (72) Italy2 (5)
 UK7 (16) Russia6 (13)
 French3 (7) Spain2 (5)
 German2 (5) Netherlands1 (2)
In terms of epidemiology, the majority of patients were imported cases (25/44) entering Vietnam from March to April 2020. These cases included returning travellers from Europe, especially the UK. There were no cases from China. While most imported cases (n = 17) were quarantined immediately, 8 cases had contact with relatives and community before SARS-CoV-2 detection and isolation at the NHTD. There were 12 cases infected following contact with positive cases in Vietnam, another five cases were acquired from either international sources or other known contacts, whereas for two cases contact details were unavailable (Table 1).

Clinical features and treatment

According to the WHO definition [28], we identified symptomatic cases as laboratory-confirmed people displaying any symptoms or signs of respiratory viral infection at admission. Laboratory-confirmed cases not having any symptoms at diagnosis and at 14 day follow-up were classified as asymptomatic. Pre-symptomatic cases were defined as a laboratory-confirmed case who did not have symptoms at diagnosis but developed clinical symptoms within 14 days. Forty-one cases were symptomatic with dry cough (n = 36), fever (n = 20), sore throat (n = 14), diarrhoea (n = 12); there were a few cases with headache, difficulty breathing, muscle aches and fatigue (Table 2). Most patients did not have underlying chronic medical diseases, but 10 had co-morbidities including cancer, diabetes, asthma, hypertension, chronic kidney disease, cerebrovascular disease, gout, vestibular disorder, heart failure or gastroesophageal reflux.
Table 2

Clinical characteristics and treatment of 44 patients with COVID-19 admitted to the National Hospital for Tropical Diseases.

Cases (44)Cases (44)
n (%)n (%)
Clinical features and history
Comorbidities
 Symptomatic41 (93) Asthma1 (2)
 Asymptomatic3 (7) Diabetes3 (7)
Symptoms Hypertension3 (7)
 Fever20 (45) Cancer2 (5)
 Dry cough36 (82) Chronic kidney disease1 (2)
 Sputum production10 (23) Cerebrovascular disease1 (2)
 Headache5 (11) Other6 (14)
 Nasal congestion5 (11)Imaging
 Sore throat14 (32) Chest X-ray abnormalities30 (68)
 Fatigue10 (23) Chest CT abnormalities34 (77)
 Shortness of breath6 (14)Complications
 Diarrhoea12 (27) Acute respiratory distress syndrome5 (11)
 Myalgia6 (14) Pneumonia34 (77)
 Other2 (5) Other2 (5)
Course of Treatment
Hospital stay (days)Treatment received
 Mean (SD)25.34 (14.95)Intravenous antibiotics11 (25)
 <105 (11)Antifungal medications3 (7)
 10–1910 (23)Systemic glucocorticoids3 (7)
 20–2917 (39)Oxygen therapy without mechanical ventilation6 (14)
 30–398 (18)Invasive ventilation (intubation)4 (9)
 40–491 (2)Extracorporeal membrane oxygenation1 (2)
 50–591 (2)Renal dialysis2 (5)
 60–690Intravenous Immunoglobulin5 (11)
 70–792 (5)
 > = 800
There were 34 cases diagnosed with pneumonia based on chest X-ray or CT abnormalities (Table 2): one case had an asymptomatic pneumonia. Notably, four patients developed critical illness with respiratory failure requiring initial oxygen supplementation, invasive intubation (four cases, 9%), with one switched to extracorporeal membrane oxygenation (Table 2 and S1 Table). Only one of the four critically ill patients did not have any underlying chronic disease (S1 Table). Critically ill patients were moved to the intensive care unit (ICU), and transferred to another department when recovering. In terms of the treatment course, 11 patients were treated with antibiotics, and three of the ICU patients received antifungals and glucocorticoids. Apart from the four critical cases, two patients required non-mechanical oxygenation. Two patients needed renal dialysis, one of whom had a history of chronic kidney disease. Intravenous immunoglobulin (IVIG) was given to five patients (Table 2). Treatment duration was around 25 days, with 17 cases between 20–29 days. Five cases needed under 10 days to recover, while two cases stayed in hospital for up to 79 days. All patients recovered and were discharged home.

Assembly and phylogenetic analysis of SARS-CoV-2 sequences

Forty-four complete SARS-CoV-2 sequences were produced using an amplicon-based approach. For each sequence library, 92–100% of paired end reads mapped to the reference genome sequence (NC_045512). Lengths varied between 29,777 and 29,872 base pairs (bp) at sequencing depths (average coverage across the whole reference genome) greater than 2000x (Table 3). All sequences were submitted to GISAID (https://www.gisaid.org/) with accession numbers listed in S2 Table. When compared to all GISAID sequences, our sequences fell into six lineages including B.1, B.1.1, B.1.1.1, B.2, B.2.1 and B.3, with the dominant lineage being B.1.1 (32/44) (Fig 1). Among the 32 B.1.1 viruses, eleven were isolated from patients who were contacts of positive cases in the community, four were from patients reporting both international travel (from UK and Germany) and contact with positive cases, 16 were from imported cases (UK six cases, Germany two, Russia three, and one case each from Italy, France, Netherlands, Spain and Japan). Two cases with unknown transmission source had B.1 and B.1.1 lineage sequences (S3 Table).
Table 3

Genome assembly of 44 SARS-CoV-2 sequences in Vietnam.

Patient IDSample IDGenome assemblyCoverage (mean)
Total readsReads mapped to reference (%)Sequence length (bp)
BN1VNHN_0148683,04099.10297778041x
BN2VNHN_0022821,17499.43298574000x
BN3VNHN_00261,654,83099.80298248200x
BN4VNHN_02071,278,94496.40298234644x
BN5VNHN_03012,923,36099.632980614205x
BN6VNHN_0300686,52299.80298303400x
BN7VNHN_03021,427,07499.57298267018x
BN8VNHN_02991,634,46299.84298288041x
BN9VNHN_0418809,04498.21298613111x
BN10VNHN_0419698,91299.09298472786x
BN11VNHN_05541,029,72495.29298583716x
BN12VNHN_07641,905,20299.81298239362x
BN13VNHN_07622,000,16899.77298299823x
BN14VNHN_08471,221,39498.60298565950x
BN15VNHN_08371,209,45691.86298244227x
BN16VNHN_0899861,51897.83298213148x
BN17VNHN_1072763,97298.16298442824x
BN18VNHN_1097754,57498.81298262999x
BN19VNHN_09851,590,20499.40298277800x
BN20VNHN_0897656,04699.35298452604x
BN21VNHN_10981,781,05299.83298218778x
BN22VNHN_1099670,40499.81298233320x
BN23VNHN_0979633,11099.87298083081x
BN24VNHN_1166988,24297.73298203621x
BN25VNHN_1167505,34999.64298212491x
BN26VNHN_1528703,28999.91298483448x
BN27VNHN_1226696,21499.90298233422x
BN28VNHN_1492597,05399.89298223654x
BN29VNHN_2406597,24099.95298322937x
BN30VNHN_17131,517,25496.84298245515x
BN31VNHN_1863538,02299.95298262660x
BN32VNHN_3096625,69299.87298423031x
BN33VNHN_3085436,92699.93298202162x
BN34VNHN_3916432,00299.96298242153x
BN35VNHN_3913640,56999.69298223156x
BN36VNHN_3629596,63699.96298232952x
BN37VNHN_4189579,38799.96298262862x
BN38VNHN_4864697,29499.98298263458x
BN39VNHN_4958920,72399.82298724532x
BN40VNHN_4851766,58499.94298283794x
BN41VNHN_4868536,94599.94298242639x
BN42VNHN_4806769,13799.86298553776x
BN43VNHN_4875681,67099.30298252822x
BN44VNHN_5152708,09097.31298192587x
Fig 1

Phylogenetic analysis of SARS-CoV-2 genomes in Vietnam.

An alignment of 1056 SARS-CoV-2 sequences including 44 from this study and 41 other Vietnamese sequences was examined using a maximum likelihood approach. Sequences have been coloured according to the provided key and relevant SARS-CoV-2 lineages are indicated. The scale represents the number of substitutions per site.

Phylogenetic analysis of SARS-CoV-2 genomes in Vietnam.

An alignment of 1056 SARS-CoV-2 sequences including 44 from this study and 41 other Vietnamese sequences was examined using a maximum likelihood approach. Sequences have been coloured according to the provided key and relevant SARS-CoV-2 lineages are indicated. The scale represents the number of substitutions per site. Although our sequences were mostly from B.1 viruses, and particularly the B.1.1 lineage, the sequences were relatively dispersed across the phylogeny. There were two notable clusters observed within the B.1.1 lineage: cluster 1 included three samples (VNHN_5152, VNHN_4875, VNHN_4868) and cluster 2 consisted of five samples (VNHN_3085, VNHN_3913, VNHN_3916, VNHN_4864, VNHN_4958) (S1 Fig). Cluster 1 sequences differed from each other at one SNP or one deletion of three nucleotides, while four sequences in cluster 2 were identical and another sequence had a 6-nucleotide deletion compared to others. Interestingly, all samples in cluster 1 were isolated from locally transmitted cases in a local hotspot in Vietnam, and included members of the same family and a neighbour. Cases in cluster 2 were residents returning from different countries: three of them were on the same flight from Germany, one from Japan and the other from Spain. A comparison of our sequences to 41 other Vietnamese sequences on GISAID collected between January and April 2020 showed little if any grouping suggesting most were unique importation events and there was little, if any, mixing between the northern (this study) and southern parts of Vietnam (S1 Fig).

Mutations in SARS-CoV-2 genomes

Using CLC genomics workbench, we identified 285 mutations covering 67 variant types among the 44 SARS-CoV-2 genomes when compared to the reference genome, averaging 6.5 variants per genome. Sixty-one variants were single nucleotide substitutions, causing 36 non-synonymous and 25 synonymous amino acid changes (Table 4). The most ubiquitous modifications were C3037T, C14408T (P323L) and A23403G (D614G) occurring in 40/44 samples. Two other variants C241T and GGG to AAC at 28881–3 were detected in 39 and 33 sequences, respectively. These variants are key markers to define lineages B.1 (C241T, C3037T, A23403G) and B.1.1 (C241T, C3037T, A23403G, GGG28881..28883AAC), the major lineages in our study. The most common variants were detected in viral sequences amplified from patients with both mild disease and in the ICU.
Table 4

Mutations in SARS-CoV-2 genomes and their detection in the GISAID database: Nucleotide variants and amino acid changes.

LocationMutation typeReferenceAlleleAmino acid changeCoding regionMutation in databasen (%)
174SNVGA-1 (0.35)
241SNVCT-39 (13.68)
313SNVCT-1 (0.35)
370SNVGA-1 (0.35)
508–522DeletionTGGTCATGTTATGGT-Gly82_Val86delnsp1known2 (0.7)
516–518DeletionTTA-Met85delnsp1known1 (0.35)
1059SNVCTT85Insp2known4 (1.4)
1440SNVGAG212Dnsp2known1 (0.35)
1666SNVTC-2 (0.7)
2480SNVAGI559Vnsp2known1 (0.35)
2558SNVCTP585Snsp2known2 (0.7)
2891SNVGAA58Tnsp3known1 (0.35)
3037SNVCT-40 (14.04)
3370SNVTA-1 (0.35)
4002SNVCTT428Insp3known1 (0.35)
4655SNVCTR646Wnsp3known1 (0.35)
4733SNVCTL672Fnsp3novel1 (0.35)
4908SNVGAG730Dnsp3known1 (0.35)
5513SNVGA-1 (0.35)
5514DeletionT-Frameshiftnsp3novel1 (0.35)
6027SNVCTP1103Lnsp3known1 (0.35)
6276SNVAGK1186Rnsp3known1 (0.35)
6502SNVAG-1 (0.35)
8422SNVGAM1901Insp3known1 (0.35)
9172SNVTC-1 (0.35)
9983SNVGAD477Nnsp4known1 (0.35)
10054SNVGA-1 (0.35)
10097SNVGAG15Snsp5known1 (0.35)
10948SNVAG-1 (0.35)
11083SNVGTL37Fnsp6known3 (1.05)
11824SNVCT-3 (1.05)
13536SNVCT-1 (0.35)
13922SNVATD161Vnsp12known1 (0.35)
14184SNVCT-2 (0.7)
14408SNVCTP323Lnsp12known40 (14.04)
14452SNVGTV338Fnsp12known1 (0.35)
14793SNVCT-1 (0.35)
14805SNVCT-3 (1.05)
15720SNVCT-1 (0.35)
18021SNVGTR595Snsp13known1 (0.35)
18877SNVCT-1 (0.35)
19839SNVTC-6 (2.11)
20578SNVGTV320Lnsp15known1 (0.35)
21057SNVCT-2 (0.7)
21058SNVCTP134Snsp16known1 (0.35)
21077SNVCTT140Insp16known1 (0.35)
21724SNVGTL54FSknown2 (0.7)
22323SNVCTS254FSknown1 (0.35)
23403SNVAGD614GSknown40 (14.04)
23731SNVCT-1 (0.35)
25311SNVGTC1250FSknown1 (0.35)
25563SNVGTQ57HORF 3aknown5 (1.75)
26028SNVCT-1 (0.35)
26144SNVGTG251VORF 3aknown3 (1.05)
26256SNVCT-1 (0.35)
26530SNVAGD3GMknown1 (0.35)
26730SNVGTV70FMknown1 (0.35)
27635SNVCTS81LORF 7aknown1 (0.35)
27679SNVCTL96FORF 7aknown2 (0.7)
27874SNVCTT40IORF 7bknown1 (0.35)
27695–27700DeletionTTCTTA-L102_I103delORF 7aknown1 (0.35)
27792–27793DeletionTT-FrameshiftORF 7bnovel1 (0.35)
28881–28883MNVGGGAACR203K; G204RNknown33 (11.58)
28812SNVGTS180INknown1 (0.35)
28905SNVCTA211VNknown5 (1.75)
29122SNVACQ283HNnovel1 (0.35)
29546SNVCA-1 (0.35)
Total285

SNV: single nucleotide variant, MNV: Multiple nucleotide variant, nsp: non-structural protein, OFR: open reading frame.

SNV: single nucleotide variant, MNV: Multiple nucleotide variant, nsp: non-structural protein, OFR: open reading frame. There were three deletions of 15, six and three nucleotides, resulting in loss of five, two and one amino acids respectively, while two deletions of one or two nucleotides led to frameshifts. No insertions among the 44 genome sequences were detected. Two novel mutations were associated with frameshifts as mentioned previously, the others were non-synonymous changes, namely C4733T (L672F) and A29122C (Q283H). Novel mutations occurred in non-structural protein 3 (ORF 1a), open reading frame 7b and N coding regions on SARS-CoV-2 genome, and were detected in three different genomes (VNHN_0148, VNHN_0762, VNHN_1166). These mutations were not associated with differences in phenotype of illness, although numbers were small. There were 14 ORFs across the SARS-CoV-2 genome [4]. Mutations detected in our study often occurred in the nsp3 region (ORF1ab) with nine variant types, followed by N, S and nsp2 regions with four variant types (Table 4). The conservative coding regions without any modifications were E, ORF6, ORF8 and ORF10. Notably, four types of mutations detected in the Spike gene did not occur in the receptor binding domain (RBD: residue 319–541) that binds to the human receptor hACE2 for viral entry [3].

Discussion

NHTD is a front-line national hospital in Hanoi for isolation and treatment of laboratory-confirmed COVID-19 cases for the northern part of Vietnam. Forty-four SARS-CoV-2 RT-PCR patients with Ct values <30 had SARS-CoV-2 WGS performed with the aim of characterising lineages and identifying mutations in patients with different transmission sources in Vietnam. This study also describes the clinical and epidemiological features, treatment and outcomes of 44 COVID-19 patients. The ages, gender and origins of COVID-19 disease are consistent with a Vietnamese Ministry of Health report summarising the demographic and epidemiological data of 207 COVID-19 cases in Vietnam from January 23 to May 1 2020 [5]. Among our 44 patients, 41 had respiratory symptoms on admission. In a study in southern Vietnam conducted in a quarantine facility rather than a referral hospital, only 57% of patients reported symptoms [18]. This difference may be due to characteristics of patients varying between a national hospital and government quarantine centres, or due to sample selection: we examined patients with Ct <30 in the real-time RT PCR, thus possibly not representing all cases at the NHTD. Common symptoms such as dry cough and fever reported in our study are similar to earlier reports of COVID-19 in China [29, 30] and Vietnam [7, 9, 16]. However, most patients did not have underlying chronic diseases, and most (77%) had pneumonia with abnormalities on chest imaging, including one asymptomatic case. The median hospital stay was 25 days, which was longer than that of other reports from Vietnam of either 14–19 or 15–18 days [8, 13]. The SARS-CoV-2 genomes in our study are complete, high coverage sequences, accounting for more than 50% of available Vietnamese genomes of SARS-CoV-2 submitted to GISAID to date. Our phylogenetic analysis of 85 SARS-CoV-2 sequences from Vietnam collected up to April 15th 2020 submitted to GISAID showed almost all were B.1 lineage and sublineages. Most of our viral genomes belonged to the B.1.1 lineage, the dominant lineage in the UK as updated on May 19 2020 [31]. This supports the epidemiological data where 50% (15/30) of international travellers in this study had returned from UK. We identified two significant clusters of B.1.1 lineage, each containing three and five samples. Combined with epidemiological factors, the cluster of three samples originated from a local hotspot village whereas the cluster of five cases was from three different countries. There was a significant difference between viruses sampled in south and north Vietnam, mainly B.1 versus B.1.1 respectively (S1 Fig). In addition, viruses within each region in Vietnam did not cluster together but with sequences from other countries, suggesting multiple introduction events and limited local transmission. We characterised the three most prevalent variants detected in 40/44 completed genome sequences, namely C3037T (synonymous substitution) and two non-synonymous mutations C14408T (nsp12: P323L), A23403G (S: D614G). Another study analysed over 10,000 SARS-CoV-2 genomes from 68 countries from four databases showed an identical result, with over 6000 sequences carrying these three variants [32]. Interestingly, these three variants always occurred together, and sometimes they were accompanied by a synonymous mutation C241T (in 39 genomes) or multiple nucleotide substitutions GGG28881..28883AAC (N: R203K, G204R; in 33 genomes), suggesting linkage of mutations among these strains. The prevalence of these mutations was consistent with major lineages identified in this study. The linkage of four changes (C241T, C3037T, C14408T, A23403T) has been described in a comprehensive study of Spike protein variants especially related to the D614G transition [33]. The D614G mutation was first detected in a SARS-CoV-2 sequence from Germany in January 2020 [34]. Subsequently, haplotypes combining these four variants appeared in Italy in February 2020, then spread throughout Europe and currently account for 78% of worldwide sequences submitted to GISAID (May 29 2020), forming the G614 SARS-CoV-2 strain [33]. Of the SARS-CoV-2 genomes from the first two patients in Vietnam (one from Wuhan who transmitted to a second case), the D614G change was not found [9]. The B.1 lineage and its sublineages have formed the majority of sequences from Vietnam since March 2020 (S1 Fig) and harbour D614G. The current variants detected in our study together with other genome sequences indicate a similar trend of transition of D614 to G614 in Vietnam, which confirms that the main transmission source of COVID-19 here was from Europe rather than from China (where the progenitor D614 strains predominate). The study by Kyogama et al. [32] agreed with our finding that nsp3 protein (ORF1ab) most usually occurred as mis-sense variants. We did not find mutations in the E, ORF6, ORF8 and ORF10 coding regions. We identified other mutations on the Spike protein (L54F, S254F, C1250F), however these changes did not occur in the receptor binding domain. Four new unique variants were detected in three SARS-CoV-2 genomes in our study, including C4733T (nsp3: L672F), A29122C (N: Q283H), deletion TT 27792..27793 (OFR7b: Leu14 frameshift) and deletion T 5514 (nsp3: Val931 frameshift). However, we did not find a link between these new mutations and disease severity. This study provides an insight into the clinical features of COVID-19 and evolutionary trends of SARS-CoV-2 in Vietnam. Genomic surveillance combined with field epidemiology will remain crucial to track and trace transmission.

Phylogenetic analysis of SARS-CoV-2 genomes in Vietnam with taxa labels.

An alignment of 1056 SARS-CoV-2 sequences including 44 from this study and 41 other Vietnamese sequences was examined using a maximum likelihood approach. This corresponds to the same phylogeny shown in Fig 1, except it has been expanded to show individual taxa labels. Sequences have been coloured according to the provided key and SARS-CoV-2 lineages and bootstrap support values are shown for all branches. The scale represents the number of substitutions per site. (TIF) Click here for additional data file.

Clinical characteristics and treatment of critically ill COVID-19 patients.

(DOCX) Click here for additional data file.

Accession number of Vietnamese SARS-CoV-2 genome sequences submitted to GISAID.

(DOCX) Click here for additional data file.

Lineage of SARS-CoV-2 strains collected at the National Hospital for Tropical Diseases.

(DOCX) Click here for additional data file.
  24 in total

1.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

2.  The Natural History and Transmission Potential of Asymptomatic Severe Acute Respiratory Syndrome Coronavirus 2 Infection.

Authors:  Nguyen Van Vinh Chau; Vo Thanh Lam; Nguyen Thanh Dung; Lam Minh Yen; Ngo Ngoc Quang Minh; Le Manh Hung; Nghiem My Ngoc; Nguyen Tri Dung; Dinh Nguyen Huy Man; Lam Anh Nguyet; Le Thanh Hoang Nhat; Le Nguyen Truc Nhu; Nguyen Thi Han Ny; Nguyen Thi Thu Hong; Evelyne Kestelyn; Nguyen Thi Phuong Dung; Tran Chanh Xuan; Tran Tinh Hien; Nguyen Thanh Phong; Tran Nguyen Hoang Tu; Ronald B Geskus; Tran Tan Thanh; Nguyen Thanh Truong; Nguyen Tan Binh; Tang Chi Thuong; Guy Thwaites; Le Van Tan
Journal:  Clin Infect Dis       Date:  2020-12-17       Impact factor: 9.079

3.  COVID-19 in Vietnam: A lesson of pre-preparation.

Authors:  Ha-Linh Quach; Ngoc-Anh Hoang
Journal:  J Clin Virol       Date:  2020-04-22       Impact factor: 3.168

4.  The largest epicenter of the coronavirus outbreak in Vietnam.

Authors:  Trang H D Nguyen; Danh C Vu
Journal:  Infect Control Hosp Epidemiol       Date:  2020-04-13       Impact factor: 3.254

5.  Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study.

Authors:  Nanshan Chen; Min Zhou; Xuan Dong; Jieming Qu; Fengyun Gong; Yang Han; Yang Qiu; Jingli Wang; Ying Liu; Yuan Wei; Jia'an Xia; Ting Yu; Xinxin Zhang; Li Zhang
Journal:  Lancet       Date:  2020-01-30       Impact factor: 79.321

6.  Describing the pattern of the COVID-19 epidemic in Vietnam.

Authors:  Van Minh Hoang; Hong Hanh Hoang; Quynh Long Khuong; Ngoc Quang La; Thi Tuyet Hanh Tran
Journal:  Glob Health Action       Date:  2020-12-31       Impact factor: 2.640

7.  Severe Acute Respiratory Syndrome Coronavirus 2 Shedding by Travelers, Vietnam, 2020.

Authors:  Thi Quynh Mai Le; Taichiro Takemura; Meng Ling Moi; Takeshi Nabeshima; Le Khanh Hang Nguyen; Vu Mai Phuong Hoang; Thi Hong Trang Ung; Thi Thanh Le; Vu Son Nguyen; Hong Quynh Anh Pham; Tran Nhu Duong; Hai Tuan Nguyen; Duy Nghia Ngu; Cong Khanh Nguyen; Kouichi Morita; Futoshi Hasebe; Duc Anh Dang
Journal:  Emerg Infect Dis       Date:  2020-06-21       Impact factor: 6.883

8.  MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.

Authors:  Kazutaka Katoh; John Rozewicki; Kazunori D Yamada
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

9.  An emergent clade of SARS-CoV-2 linked to returned travellers from Iran.

Authors:  John-Sebastian Eden; Rebecca Rockett; Ian Carter; Hossinur Rahman; Joep de Ligt; James Hadfield; Matthew Storey; Xiaoyun Ren; Rachel Tulloch; Kerri Basile; Jessica Wells; Roy Byun; Nicky Gilroy; Matthew V O'Sullivan; Vitali Sintchenko; Sharon C Chen; Susan Maddocks; Tania C Sorrell; Edward C Holmes; Dominic E Dwyer; Jen Kok
Journal:  Virus Evol       Date:  2020-04-10

10.  SARS-CoV-2 and co-infections detection in nasopharyngeal throat swabs of COVID-19 patients by metagenomics.

Authors:  Le Van Tan; Nguyen Thi Thu Hong; Nghiem My Ngoc; Tran Tan Thanh; Vo Thanh Lam; Lam Anh Nguyet; Le Nguyen Truc Nhu; Nguyenn Thi Ha Ny; Ngo Ngoc Quang Minh; Dinh Nguyen Huy Man; Vu Thi Ty Hang; Phan Nguyen Quoc Khanh; Tran Chanh Xuan; Nguyen Thanh Phong; Tran Nguyen Hoang Tu; Tran Tinh Hien; Le Manh Hung; Nguyen Thanh Truong; Lamh Min Yen; Nguyen Thanh Dung; Guy Thwaites; Nguyen Van Vinh Chau
Journal:  J Infect       Date:  2020-06-17       Impact factor: 6.072

View more
  11 in total

1.  Post-traumatic stress disorder, anxiety, depression and related factors among COVID-19 patients during the fourth wave of the pandemic in Vietnam.

Authors:  Hoang Bac Nguyen; Thi Hong Minh Nguyen; Thi Hong Nhan Vo; Thi Cam Nhung Vo; Duc Nguyet Quynh Nguyen; Huu-Thinh Nguyen; Tuan-Ngan Tang; Thi-Hiep Nguyen; Van Trang Do; Quang Binh Truong
Journal:  Int Health       Date:  2022-06-13       Impact factor: 3.131

2.  Identifying COVID-19 Severity-Related SARS-CoV-2 Mutation Using a Machine Learning Method.

Authors:  Feiming Huang; Lei Chen; Wei Guo; Xianchao Zhou; Kaiyan Feng; Tao Huang; Yudong Cai
Journal:  Life (Basel)       Date:  2022-05-28

3.  Investigation of target sequencing of SARS-CoV-2 and immunogenic GWAS profiling in host cells of COVID-19 in Vietnam.

Authors:  Tham H Hoang; Giang M Vu; Mai H Tran; Trang T H Tran; Quang D Le; Khanh V Tran; Tue T Nguyen; Lan T N Nguyen; Thinh H Tran; Van T Ta; Nam S Vo
Journal:  BMC Infect Dis       Date:  2022-06-19       Impact factor: 3.667

4.  Full-length genome characterization and phylogenetic analysis of SARS-CoV-2 virus strains from Yogyakarta and Central Java, Indonesia.

Authors:  Hendra Wibawa; Mohamad Saifudin Hakim; Edwin Widyanto Daniwijaya; Ludhang Pradipta Rizki; Endah Supriyati; Dwi Aris Agung Nugrahaningsih; Kristy Iskandar; Nungki Anggorowati; Alvin Santoso Kalim; Dyah Ayu Puspitarani; Kemala Athollah; Eggi Arguni; Titik Nuryastuti; Tri Wibawa
Journal:  PeerJ       Date:  2020-12-21       Impact factor: 2.984

5.  A systematic review on papers that study on Single Nucleotide Polymorphism that affects coronavirus 2019 severity.

Authors:  Siyeon Suh; Sol Lee; Ho Gym; Sanghyuk Yoon; Seunghwan Park; Jihi Cha; Do-Hyung Kwon; YunSu Yang; Sun Ha Jee
Journal:  BMC Infect Dis       Date:  2022-01-12       Impact factor: 3.090

6.  Genomic diversity of SARS-CoV-2 in Malaysia.

Authors:  Noorliza Mohamad Noordin; Joon Liang Tan; Chee Kheong Chong; Yu Kie Chem; Norazimah Tajudin; Rehan Shuhada Abu Bakar; Selvanesan Sengol; Hannah Yik Phing Phoon; Nurul Aina Murni Che Azid; W Nur Afiza W Mohd Arifin; Zirwatul Adilah Aziz; Hani Hussin; Nurul Syahida Ibrahim; Aziyati Omar; Ushananthiny Ravi; Kamal Hisham Kamarul Zaman; Mohd Asri Yamin; Yun Fong Ngeow
Journal:  PeerJ       Date:  2021-11-03       Impact factor: 2.984

Review 7.  COVID-19 Infection in Children and Infants: Current Status on Therapies and Vaccines.

Authors:  Giuseppina Malcangi; Alessio Danilo Inchingolo; Angelo Michele Inchingolo; Fabio Piras; Vito Settanni; Grazia Garofoli; Giulia Palmieri; Sabino Ceci; Assunta Patano; Antonio Mancini; Luigi Vimercati; Damiano Nemore; Arnaldo Scardapane; Biagio Rapone; Alexandra Semjonova; Maria Teresa D'Oria; Luigi Macchia; Ioana Roxana Bordea; Giovanni Migliore; Antonio Scarano; Felice Lorusso; Gianluca Martino Tartaglia; Delia Giovanniello; Ludovica Nucci; Nicola Maggialetti; Antonio Parisi; Marina Di Domenico; Nicola Brienza; Silvio Tafuri; Pasquale Stefanizzi; Luigi Curatoli; Alberto Corriero; Maria Contaldo; Francesco Inchingolo; Gianna Dipalma
Journal:  Children (Basel)       Date:  2022-02-12

8.  Capacity and use of diagnostics and treatment for patients with severe acute respiratory infections in the pre-COVID-19 era in district and provincial hospitals in Viet Nam.

Authors:  Vu Quoc Dat; Kim Bao Giang; Hieu Quang Vu; Satoko Otsu
Journal:  Western Pac Surveill Response J       Date:  2021-11-30

Review 9.  COVID-19 in Southeast Asia: current status and perspectives.

Authors:  Dinh-Toi Chu; Suong-Mai Vu Ngoc; Hue Vu Thi; Yen-Vy Nguyen Thi; Thuy-Tien Ho; Van-Thuan Hoang; Vijai Singh; Jaffar A Al-Tawfiq
Journal:  Bioengineered       Date:  2022-02       Impact factor: 3.269

10.  Comparative study of predicted miRNA between Indonesia and China (Wuhan) SARS-CoV-2: a bioinformatics analysis.

Authors:  Agus Rahmadi; Ismaily Fasyah; Digdo Sudigyo; Arif Budiarto; Bharuno Mahesworo; Alam Ahmad Hidayat; Bens Pardamean
Journal:  Genes Genomics       Date:  2021-06-21       Impact factor: 1.839

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.