Literature DB >> 33505425

Deciphering the Subtype Differentiation History of SARS-CoV-2 Based on a New Breadth-First Searching Optimized Alignment Method Over a Global Data Set of 24,768 Sequences.

Qianyu Lin1, Yunchuanxiang Huang1, Ziyi Jiang2, Feng Wu2, Lan Ma1,2,3.   

Abstract

SARS-CoV-2 has caused a worldwide pandemic. Existing research on coronavirus mutations is based on small data sets, and multiple sequence alignment using a global-scale data set has yet to be conducted. Statistical analysis of integral mutations and global spread are necessary and could help improve primer design for nucleic acid diagnosis and vaccine development. Here, we optimized multiple sequence alignment using a conserved sequence search algorithm to align 24,768 sequences from the GISAID data set. A phylogenetic tree was constructed using the maximum likelihood (ML) method. Coronavirus subtypes were analyzed via t-SNE clustering. We performed haplotype network analysis and t-SNE clustering to analyze the coronavirus origin and spread. Overall, we identified 33 sense, 17 nonsense, 79 amino acid loss, and 4 amino acid insertion mutations in full-length open reading frames. Phylogenetic trees were successfully constructed and samples clustered into subtypes. The COVID-19 pandemic differed among countries and continents. Samples from the United States and western Europe were more diverse, and those from China and Asia mainly contained specific subtypes. Clades G/GH/GR are more likely to be the origin clades of SARS-CoV-2 compared with clades S/L/V. Conserved sequence searches can be used to segment long sequences, making large-scale multisequence alignment possible, facilitating more comprehensive gene mutation analysis. Mutation analysis of the SARS-CoV-2 can inform primer design for nucleic acid diagnosis to improve virus detection efficiency. In addition, research into the characteristics of viral spread and relationships among geographic regions can help formulate health policies and reduce the increase of imported cases.
Copyright © 2021 Lin, Huang, Jiang, Wu and Ma.

Entities:  

Keywords:  SARS-CoV-2; haplotype network analysis; multiple sequence alignment; phylogenetic tree; t-SNE

Year:  2021        PMID: 33505425      PMCID: PMC7831388          DOI: 10.3389/fgene.2020.591833

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  22 in total

1.  A Multi-Targeting, Nucleoside-Modified mRNA Influenza Virus Vaccine Provides Broad Protection in Mice.

Authors:  Alec W Freyn; Jamile Ramos da Silva; Victoria C Rosado; Carly M Bliss; Matthew Pine; Barbara L Mui; Ying K Tam; Thomas D Madden; Luís Carlos de Souza Ferreira; Drew Weissman; Florian Krammer; Lynda Coughlan; Peter Palese; Norbert Pardi; Raffael Nachbagauer
Journal:  Mol Ther       Date:  2020-04-19       Impact factor: 11.454

2.  Data, disease and diplomacy: GISAID's innovative contribution to global health.

Authors:  Stefan Elbe; Gemma Buckland-Merrett
Journal:  Glob Chall       Date:  2017-01-10

3.  Interactive Tree Of Life (iTOL) v4: recent updates and new developments.

Authors:  Ivica Letunic; Peer Bork
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

Review 4.  From SARS-CoV to SARS-CoV-2: safety and broad-spectrum are important for coronavirus vaccine development.

Authors:  Cuiqing Ma; Shan Su; Jiachao Wang; Lin Wei; Lanying Du; Shibo Jiang
Journal:  Microbes Infect       Date:  2020-05-11       Impact factor: 2.700

5.  RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.

Authors:  Alexey M Kozlov; Diego Darriba; Tomáš Flouri; Benoit Morel; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

6.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

Authors:  Robert C Edgar
Journal:  BMC Bioinformatics       Date:  2004-08-19       Impact factor: 3.169

7.  Early phylogenetic estimate of the effective reproduction number of SARS-CoV-2.

Authors:  Alessia Lai; Annalisa Bergna; Carla Acciarri; Massimo Galli; Gianguglielmo Zehender
Journal:  J Med Virol       Date:  2020-03-03       Impact factor: 2.327

8.  Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period.

Authors:  Stephen M Kissler; Christine Tedijanto; Yonatan H Grad; Marc Lipsitch; Edward Goldstein
Journal:  Science       Date:  2020-04-14       Impact factor: 47.728

9.  Clinical progression of patients with COVID-19 in Shanghai, China.

Authors:  Jun Chen; Tangkai Qi; Li Liu; Yun Ling; Zhiping Qian; Tao Li; Feng Li; Qingnian Xu; Yuyi Zhang; Shuibao Xu; Zhigang Song; Yigang Zeng; Yinzhong Shen; Yuxin Shi; Tongyu Zhu; Hongzhou Lu
Journal:  J Infect       Date:  2020-03-19       Impact factor: 6.072

10.  Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus.

Authors:  Bette Korber; Will M Fischer; Sandrasegaram Gnanakaran; Hyejin Yoon; James Theiler; Werner Abfalterer; Nick Hengartner; Elena E Giorgi; Tanmoy Bhattacharya; Brian Foley; Kathryn M Hastie; Matthew D Parker; David G Partridge; Cariad M Evans; Timothy M Freeman; Thushan I de Silva; Charlene McDanal; Lautaro G Perez; Haili Tang; Alex Moon-Walker; Sean P Whelan; Celia C LaBranche; Erica O Saphire; David C Montefiori
Journal:  Cell       Date:  2020-07-03       Impact factor: 66.850

View more
  1 in total

1.  The Mutational Landscape of SARS-CoV-2 Variants of Concern Recovered From Egyptian Patients in 2021.

Authors:  Mohamed G Seadawy; Reem Binsuwaidan; Badriyah Alotaibi; Thanaa A El-Masry; Bassem E El-Harty; Ahmed F Gad; Walid F Elkhatib; Maisra M El-Bouseary
Journal:  Front Microbiol       Date:  2022-07-07       Impact factor: 6.064

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.