| Literature DB >> 32950697 |
Carlos Franco-Muñoz1, Diego A Álvarez-Díaz2, Katherine Laiton-Donato2, Magdalena Wiesner3, Patricia Escandón3, José A Usme-Ciro4, Nicolás D Franco-Sierra5, Astrid C Flórez-Sánchez6, Sergio Gómez-Rangel7, Luz D Rodríguez-Calderon7, Juliana Barbosa-Ramirez7, Erika Ospitia-Baez7, Diana M Walteros8, Martha L Ospina-Martinez9, Marcela Mercado-Reyes10.
Abstract
SARS-CoV-2 is a new member of the genus Betacoronavirus, responsible for the COVID-19 pandemic. The virus crossed the species barrier and established in the human population taking advantage of the spike protein high affinity for the ACE receptor to infect the lower respiratory tract. The Nucleocapsid (N) and Spike (S) are highly immunogenic structural proteins and most commercial COVID-19 diagnostic assays target these proteins. In an unpredictable epidemic, it is essential to know about their genetic variability. The objective of this study was to describe the substitution frequency of the S and N proteins of SARS-CoV-2 in South America. A total of 504 amino acid and nucleotide sequences of the S and N proteins of SARS-CoV-2 from seven South American countries (Argentina, Brazil, Chile, Ecuador, Peru, Uruguay, and Colombia), reported as of June 3, and corresponding to samples collected between March and April 2020, were compared through substitution matrices using the Muscle algorithm. Forty-three sequences from 13 Colombian departments were obtained in this study using the Oxford Nanopore and Illumina MiSeq technologies, following the amplicon-based ARTIC network protocol. The substitutions D614G in S and R203K/G204R in N were the most frequent in South America, observed in 83% and 34% of the sequences respectively. Strikingly, genomes with the conserved position D614 were almost completely replaced by genomes with the G614 substitution between March to April 2020. A similar replacement pattern was observed with R203K/G204R although more marked in Chile, Argentina and Brazil, suggesting similar introduction history and/or control strategies of SARS-CoV-2 in these countries. It is necessary to continue with the genomic surveillance of S and N proteins during the SARS-CoV-2 pandemic as this information can be useful for developing vaccines, therapeutics and diagnostic tests.Entities:
Keywords: Non-synonymous substitutions; Nucleocapsid; SARS-CoV-2; South America; Spike
Year: 2020 PMID: 32950697 PMCID: PMC7497549 DOI: 10.1016/j.meegid.2020.104557
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Nucleotide substitutions and amino acid changes in the Spike and Nucleocapsid proteins of 43 SARS-CoV-2 genomes from Colombia generated in this study. Sequences from other countries of South America were retrieved from GISAID database. Multiple alignment was made using the Muscle algorithm in MEGA X software. Location of the substitutions was estimated based on the reference genome NC_045512.2. Left panel: nucleotides; right panel: amino acids. The uncovered nucleotide and amino acids during complete genome sequencing are represented by an interrogate mark (?). Nucleotide and amino acid positions are indicated in the number under each gene.
Fig. 1Non-synonymous substitutions in the Spike and Nucleocapsid proteins of SARS-CoV-2 in South America. A. Non-synonymous substitution sites in the S protein. Blue bars represent the genomes with conserved positions, red bars represent the genomes with substitution at the indicated site (at least one genome per site). B. Non-synonymous substitutions in the N. Blue bars represent the genome with conserved positions, red bars represent the genomes with substitution at the indicated site (at least one genome). Green bars represent the number of genomes with undetermined amino acids at the indicated site. Black arrows show substitutions found in the 43 Colombian genomes generated in this study. The location of the substitutions was estimated based on the reference genome NC_045512.2. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Spatiotemporal distribution of the D614G substitution in the Spike protein of SARS-CoV-2 in South America. A. Distribution of D614/G614 substitutions in SARS-CoV-2 genomes reported as of Jun 3th 2020 in South American countries B. Spatiotemporal distribution of D614/G614 substitutions in SARS-CoV-2 genomes sequenced between March and April, 2020 in South America, according with the sample collection date.
Fig. 3Spatiotemporal distribution of the most frequent substitutions in the Nucleocapsid protein of SARS-CoV-2 in South America. A. Distribution of R203/G204 and K203/R204 substitutions in SARS-CoV-2 genomes reported as of Jun 3th, 2020 in South American countries. B. Spatiotemporal of R203/G204 and K203/R204 substitutions in SARS-CoV-2 genomes sequenced between March and April, 2020 in South America, according with the sample collection date. C. Distribution of I292/T292 substitutions of nucleocapsid protein in SARS-CoV-2 genomes reported as of Jun 3th, 2020 in South American countries. D. Distribution of I292 and T292 substitutions in SARS-CoV-2 genomes sequenced between March and April, 2020 in South America, according with the sample collection date.