Literature DB >> 33181329

Analysis of the potential impact of genomic variants in global SARS-CoV-2 genomes on molecular diagnostic assays.

Abhinav Jain1, Mercy Rophina1, Saurabh Mahajan2, Bhavya Balaji Krishnan3, Manasa Sharma4, Sreya Mandal2, Teresa Fernandez2, Sumayra Sultanji2, Bani Jolly1, Samatha Mathew1, Sridhar Sivasubbu1, Vinod Scaria5.   

Abstract

An epidemic of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causing coronavirus diseases (C0VID-19) initially reported in Wuhan, China has rapidly emerged into a global pandemic affecting millions of people worldwide. Molecular detection of SARS-CoV-2 using reverse transcription polymerase chain reaction (RT-PCR) forms the mainstay in screening, diagnosis and epidemiology of the disease. Since the virus evolves by accumulating base substitutions, mutations in the viral genome could possibly affect the accuracy of RT-PCR-based detection assays. The recent availability of genomes of SARS-CoV-2 isolates motivated us to assess the presence and potential impact of variations in target sites of the oligonucleotide primers and probes used in molecular diagnosis. We catalogued a total of 132 primer or probe sequences from literature and data available in the public domain. Our analysis revealed that a total of 5862 unique genetic variants mapped to at least one of the 132 primer or probe binding sites in the genome. A total of 29 unique variants were present in ≥ 1% of genomes from at least one of the continents (Asia, Africa, Australia, Europe, North America, and South America) that mapped to 36 unique primers or probes binding sites. Similarly, a total of 27 primer or probe binding sites had cumulative variants frequency of ≥ 1% in the global SARS-CoV-2 genomes. These included primers or probes sites which are used worldwide for molecular diagnosis as well as approved by national and international agencies. We also found 286 SARS-CoV-2 genomic regions with low variability at a continuous stretch of ≥ 20bps that could be potentially used for primer designing. This highlights the need for sequencing genomes of emerging pathogens to enable evidence-based policies for development and approval of diagnostics.
Copyright © 2020 The Authors. Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  COVID-19; Genomes; Gibbs free energy; Reverse transcription polymerase chain reaction; SARS-CoV-2; Variations

Mesh:

Year:  2020        PMID: 33181329      PMCID: PMC7834429          DOI: 10.1016/j.ijid.2020.10.086

Source DB:  PubMed          Journal:  Int J Infect Dis        ISSN: 1201-9712            Impact factor:   3.623


SARS-CoV-2 variants impact RT-PCR efficiency in detection. A total of 29 global SARS-CoV-2 genetic variants had a frequency ≥ 1%. The thermodynamic stability of the virus-primers complex gets perturbed. A number of recommended primer or probe sequences had high variant frequency. Initially reported from a city in China, the coronavirus disease 2019 (COVID-19) has now rapidly emerged as a global pandemic. Reverse transcription polymerase chain reaction (RT-PCR) based assays have been the mainstay for the diagnosis and screening of COVID-19 due to their high sensitivity and specificity (Shen et al. 2020). These assays utilize oligonucleotide primers and probes specific to the viral nucleic acid. The SARS-CoV-2 has been continuously evolving and has an estimated substitution rate of 1.19–1.31 × 10−3 per site per year (Li et al. 2020). Recent reports that suggest genetic variation in viruses at the primers or probes binding site could decrease the sensitivity of RT-PCR based assays (Yang et al. 2014). Motivated by the availability of a large number of genomes of SARS-CoV-2 isolates globally, we attempted to understand the genomic variants and their potential impact on molecular diagnostic assays. We analysed the genome sequences of SARS-CoV-2 isolates deposited in GISAID (Shu and McCauley 2017) as on 26th September 2020. Only complete genome sequences that had ≥99% alignment with the Wuhan-Hu-1 reference genome (NC_045512.2) (Wu et al. 2020) and <5% degenerate bases were considered for the analysis. Genome sequences having clustered mutations and higher than expected divergence were also excluded from the analysis. The individual genomes were re-aligned to the Wuhan-Hu-1 reference genome using EMBOSS needle (Rice et al. 2000) and the pairwise alignments were parsed to identify variants using bespoke scripts. The primer/probe sequences were compiled using extensive literature searches as well as from public databases and were mapped to the reference genome using BLAST (Altschul et al. 1990). The SARS-CoV-2 genomic variant coordinates were overlapped with the primer/probe binding sites. Melting temperature (Tm) and Gibbs free energy (ΔG°37) at standard condition were calculated for primer or probe sequences. We also evaluated the internal and terminal mismatches that could have an impact on the thermodynamic stability of the nucleic acid secondary structure as well as on the Tm Supplementary Method 1. We have also identified regions in the SARS-CoV-2 genome with low variability. We have considered variants below 95th percentile of the frequency and identified continuous stretches of ≥ 20bps for designing primers. The choice of 20 bps is guided by the standard length of the primer/probe sequences. A total of 45,830 high quality genome sequences which comprise 4779 sequences form Asia, 25,091 sequences from Europe, 859 from Africa, 12,949 sequences from North America, 791 sequences from South America and 1361 from Australia were used in the analysis. Our analysis revealed a total of 88,880 unique single nucleotide variants (SNVs) across the genome. We compiled a total of 132 primers or probe sequences Supplementary Data 1. A total of 5862 unique genetic variants mapped to 132 primers or probes binding sites in the SARS-CoV-2 genome. Out of these, a total of 29 unique variants had allele frequency ≥ 1% in at least one of the six continents from where the SARS-CoV-2 genomes were isolated Table 1 and Figure 1 . We have also observed potential differences in the ΔG°37 and Tm, that affect the thermodynamic stability of secondary structure and the annealing of the primers and probes to the viral cDNA/RNA respectively Table 1. Of significant interest, three variants with over 30% frequency each in genomes mapped to the primer 2019-nCoV-NFP GGGGAACTTCTCCTGCTAGAAT that targets N gene which is a part of the China Centers for Disease Control and Prevention (CDC) protocol (WHO in house assay, 2020). A cumulative variant frequency of 93.5% was found in 2019-nCoV-NFP binding site. Variants with >1% frequency were also found in primer / probes encompassing S, M, ORF1ab, and ORF3a genes Table 1. A total of 27 primers and probes sequences had cumulative variant frequency >1% of which 11 were approved by the national regulatory bodies mainly by the Centers for Disease Control and Prevention (CDC) and World Health Organization (WHO) and has been widely used across the globe Supplementary Data 2. Our analysis also suggests 286 genomic regions/sites with variants frequency below 95th percentile (corresponding to variant frequency of 1.7 × 10−4) Supplementary Figure 1 and Supplementary Data 3.
Table 1

Summary of Primer and Probe sequences and genomic variants analysis.

The variant frequency, primers/probes frequency, Gibbs free energy (ΔG), and melting temperature (Tm) for reference and alternate allele in the Indian SARS-CoV-2 isolates. Only primers/probes with a frequency of more than 1% in any of the six continents are included in this Table.

Tm-Melting Temperature Temperature, ΔG- Gibbs Free Energy, Ref- Reference, Alt- Alternate, No.- Number, Afr- Africa, Aus- Australia, Eur- Europe, NA-North America,SA- South America.

Figure 1

Genome Wide distribution (A) and frequency (B) of the 29 genetic variants with allele Frequency ≥ 1% in at least one of 6 continents.

Note: Colour required in Print.

Summary of Primer and Probe sequences and genomic variants analysis. The variant frequency, primers/probes frequency, Gibbs free energy (ΔG), and melting temperature (Tm) for reference and alternate allele in the Indian SARS-CoV-2 isolates. Only primers/probes with a frequency of more than 1% in any of the six continents are included in this Table. Tm-Melting Temperature Temperature, ΔG- Gibbs Free Energy, Ref- Reference, Alt- Alternate, No.- Number, Afr- Africa, Aus- Australia, Eur- Europe, NA-North America,SA- South America. Genome Wide distribution (A) and frequency (B) of the 29 genetic variants with allele Frequency ≥ 1% in at least one of 6 continents. Note: Colour required in Print. Our analysis suggests that genome sequencing of isolates in an epidemic could provide useful insights into assessing the diagnostic efficacies as also suggested by previous authors (Khan and Cheung, 2020). We surmise that this could possibly drive policies on evaluation and approvals of the assays for screening and diagnosis. The study also highlights the need for rapid and wide-spread sharing of genomic data of pathogens as well as molecular probe information through public archives during pandemics.

Author contributions

MR performed the genome analysis and variant calls. BJ performed the quality assessment of the genome dataset. AJ and SM1 co-ordinated the compendium of primers and probes with help of Bhavya Balaji Krishnan, Manasa Sharma, Sreya Mandal, Teresa Fernandez and Sumayra Sultanji. SM2 contributed to mapping the primers to the genomic loci. AJ performed the analysis of variants mapping to the probe-target sites and was assisted by MR. VS and SS provided the conceptual overview to the analysis. VS, MR and AJ wrote the manuscript, the content and analysis which was read and agreed upon by all authors.

Ethical approval

The study does not require any ethical approval

Declaration of interests

None
  8 in total

1.  EMBOSS: the European Molecular Biology Open Software Suite.

Authors:  P Rice; I Longden; A Bleasby
Journal:  Trends Genet       Date:  2000-06       Impact factor: 11.639

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Newly emerging mutations in the matrix genes of the human influenza A(H1N1)pdm09 and A(H3N2) viruses reduce the detection sensitivity of real-time reverse transcription-PCR.

Authors:  Ji-Rong Yang; Chuan-Yi Kuo; Hsiang-Yi Huang; Fu-Ting Wu; Yi-Lung Huang; Chieh-Yu Cheng; Yu-Ting Su; Feng-Yee Chang; Ho-Sheng Wu; Ming-Tsan Liu
Journal:  J Clin Microbiol       Date:  2013-10-23       Impact factor: 5.948

4.  GISAID: Global initiative on sharing all influenza data - from vision to reality.

Authors:  Yuelong Shu; John McCauley
Journal:  Euro Surveill       Date:  2017-03-30

5.  Evolutionary history, potential intermediate animal host, and cross-species analyses of SARS-CoV-2.

Authors:  Xingguang Li; Junjie Zai; Qiang Zhao; Qing Nie; Yi Li; Brian T Foley; Antoine Chaillon
Journal:  J Med Virol       Date:  2020-03-11       Impact factor: 2.327

6.  A new coronavirus associated with human respiratory disease in China.

Authors:  Fan Wu; Su Zhao; Bin Yu; Yan-Mei Chen; Wen Wang; Zhi-Gang Song; Yi Hu; Zhao-Wu Tao; Jun-Hua Tian; Yuan-Yuan Pei; Ming-Li Yuan; Yu-Ling Zhang; Fa-Hui Dai; Yi Liu; Qi-Min Wang; Jiao-Jiao Zheng; Lin Xu; Edward C Holmes; Yong-Zhen Zhang
Journal:  Nature       Date:  2020-02-03       Impact factor: 49.962

Review 7.  Recent advances and perspectives of nucleic acid detection for coronavirus.

Authors:  Minzhe Shen; Ying Zhou; Jiawei Ye; Abdu Ahmed Abdullah Al-Maskri; Yu Kang; Su Zeng; Sheng Cai
Journal:  J Pharm Anal       Date:  2020-03-01
  8 in total
  12 in total

1.  Initial Insights Into the Genetic Epidemiology of SARS-CoV-2 Isolates From Kerala Suggest Local Spread From Limited Introductions.

Authors:  Chandni Radhakrishnan; Mohit Kumar Divakar; Abhinav Jain; Prasanth Viswanathan; Rahul C Bhoyar; Bani Jolly; Mohamed Imran; Disha Sharma; Mercy Rophina; Gyan Ranjan; Paras Sehgal; Beena Philomina Jose; Rajendran Vadukkoot Raman; Thulaseedharan Nallaveettil Kesavan; Kalpana George; Sheela Mathew; Jayesh Kumar Poovullathil; Sajeeth Kumar Keeriyatt Govindan; Priyanka Raveendranadhan Nair; Shameer Vadekkandiyil; Vineeth Gladson; Midhun Mohan; Fairoz Cheriyalingal Parambath; Mohit Mangla; Afra Shamnath; Sridhar Sivasubbu; Vinod Scaria
Journal:  Front Genet       Date:  2021-03-17       Impact factor: 4.599

Review 2.  Characterization of SARS-CoV-2 different variants and related morbidity and mortality: a systematic review.

Authors:  SeyedAhmad SeyedAlinaghi; Pegah Mirzapour; Omid Dadras; Zahra Pashaei; Amirali Karimi; Mehrzad MohsseniPour; Mahdi Soleymanzadeh; Alireza Barzegary; Amir Masoud Afsahi; Farzin Vahedi; Ahmadreza Shamsabadi; Farzane Behnezhad; Solmaz Saeidi; Esmaeil Mehraeen
Journal:  Eur J Med Res       Date:  2021-06-08       Impact factor: 2.175

3.  Deepening of In Silico Evaluation of SARS-CoV-2 Detection RT-qPCR Assays in the Context of New Variants.

Authors:  Mathieu Gand; Kevin Vanneste; Isabelle Thomas; Steven Van Gucht; Arnaud Capron; Philippe Herman; Nancy H C Roosens; Sigrid C J De Keersmaecker
Journal:  Genes (Basel)       Date:  2021-04-13       Impact factor: 4.096

4.  Setting-Up a Rapid SARS-CoV-2 Genome Assessment by Next-Generation Sequencing in an Academic Hospital Center (LPCE, Louis Pasteur Hospital, Nice, France).

Authors:  Paul Hofman; Olivier Bordone; Emmanuel Chamorey; Jonathan Benzaquen; Renaud Schiappa; Virginie Lespinet-Fabre; Elisabeth Lanteri; Patrick Brest; Baharia Mograbi; Charlotte Maniel; Virginie Tanga; Maryline Allegra; Myriam Salah; Julien Fayada; Jacques Boutros; Sylvie Leroy; Simon Heeke; Véronique Hofman; Charles-Hugo Marquette; Marius Ilié
Journal:  Front Med (Lausanne)       Date:  2022-01-11

Review 5.  Role of genomics in combating COVID-19 pandemic.

Authors:  K A Saravanan; Manjit Panigrahi; Harshit Kumar; Divya Rajawat; Sonali Sonejita Nayak; Bharat Bhushan; Triveni Dutt
Journal:  Gene       Date:  2022-03-04       Impact factor: 3.688

6.  Optimization of the SARS-CoV-2 ARTIC Network V4 Primers and Whole Genome Sequencing Protocol.

Authors:  Arnold W Lambisia; Khadija S Mohammed; Timothy O Makori; Leonard Ndwiga; Maureen W Mburu; John M Morobe; Edidah O Moraa; Jennifer Musyoki; Nickson Murunga; Jane N Mwangi; D James Nokes; Charles N Agoti; Lynette Isabella Ochola-Oyier; George Githinji
Journal:  Front Med (Lausanne)       Date:  2022-02-17

Review 7.  SARS-CoV-2 Diagnostics Based on Nucleic Acids Amplification: From Fundamental Concepts to Applications and Beyond.

Authors:  João M Vindeirinho; Eva Pinho; Nuno F Azevedo; Carina Almeida
Journal:  Front Cell Infect Microbiol       Date:  2022-03-23       Impact factor: 5.293

8.  A Two-Time Point Analysis of Gut Microbiota in the General Population of Buenos Aires and Its Variation Due to Preventive and Compulsory Social Isolation During the COVID-19 Pandemic.

Authors:  Pablo Aguilera; María Florencia Mascardi; Fiorella Sabrina Belforte; Ayelén Daiana Rosso; Sofía Quesada; Ignacio Llovet; Gregorio Iraola; Julieta Trinks; Alberto Penas-Steinhardt
Journal:  Front Microbiol       Date:  2022-03-24       Impact factor: 5.640

9.  Profiling RT-LAMP tolerance of sequence variation for SARS-CoV-2 RNA detection.

Authors:  Esta Tamanaha; Yinhua Zhang; Nathan A Tanner
Journal:  PLoS One       Date:  2022-03-24       Impact factor: 3.240

Review 10.  The Spike of SARS-CoV-2: Uniqueness and Applications.

Authors:  Ranjith Kumavath; Debmalya Barh; Bruno Silva Andrade; Madangchanok Imchen; Flavia Figueira Aburjaile; Athira Ch; Diego Lucas Neres Rodrigues; Sandeep Tiwari; Khalid J Alzahrani; Aristóteles Góes-Neto; Marianna E Weener; Preetam Ghosh; Vasco Azevedo
Journal:  Front Immunol       Date:  2021-07-08       Impact factor: 7.561

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.