| Literature DB >> 35369526 |
Qiang Huang1, Qiang Zhang2, Paul W Bible3, Qiaoxing Liang4, Fangfang Zheng5, Ying Wang1, Yuantao Hao1, Yu Liu1.
Abstract
Early detection of SARS-CoV-2 variants enables timely tracking of clinically important strains in order to inform the public health response. Current subtype-based variant surveillance depending on prior subtype assignment according to lag features and their continuous risk assessment may delay this process. We proposed a weighted network framework to model the frequency trajectories of mutations (FTMs) for SARS-CoV-2 variant tracing, without requiring prior subtype assignment. This framework modularizes the FTMs and conglomerates synchronous FTMs together to represent the variants. It also generates module clusters to unveil the epidemic stages and their contemporaneous variants. Eventually, the module-based variants are assessed by phylogenetic tree through sub-sampling to facilitate communication and control of the epidemic. This process was benchmarked using worldwide GISAID data, which not only demonstrated all the methodology features but also showed the module-based variant identification had highly specific and sensitive mapping with the global phylogenetic tree. When applying this process to regional data like India and South Africa for SARS-CoV-2 variant surveillance, the approach clearly elucidated the national dispersal history of the viral variants and their co-circulation pattern, and provided much earlier warning of Beta (B.1.351), Delta (B.1.617.2), and Omicron (B.1.1.529). In summary, our work showed that the weighted network modeling of FTMs enables us to rapidly and easily track down SARS-CoV-2 variants overcoming prior viral subtyping with lag features, accelerating the understanding and surveillance of COVID-19.Entities:
Keywords: SARS-CoV-2; frequency trajectories; mutations; variant tracing; weighted network analysis
Year: 2022 PMID: 35369526 PMCID: PMC8966897 DOI: 10.3389/fmicb.2022.859241
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Workflows for SARS-CoV-2 variant surveillance. (A) Workflow comparison between subtype-based and FTM-based variant surveillance methods. (B) Outline of a weighted network framework for variant surveillance using FTMs.
FIGURE 2Synchronous temporal changes between variant-specific FTMs and variant prevalence. (A) Weekly distribution of SARS-CoV-2 genome sequences according to sampling time. (B) Time course of major variant distribution in collected sequences. (C) A Wald’s linkage hierarchical cluster tree of frequency trajectories of mutations. One hundred and fifty-eight mutations passing filtration were analyzed, annotated and displayed. Variant-specific mutations were flagged.
FIGURE 3A benchmark to use weighted network framework for identification of worldwide pandemic variants. (A) Clustering dendrogram of 158 FTMs from GISAID worldwide data. The module numbers are labeled and module clusters are highlighted with different colors. (B) The heatmap of module-based variant prevalence. The variants were determined by core mutations within each module. The modules were reordered and colored according to their module clusters and time course. (C) Network graph with topology overlap values > 10– 3 to show the relationship between nodes and modules of the weighted network. (D) Network graph with topology overlap values > 0.1. (E) Phylogenic evaluation of detected worldwide pandemic variants. Time-resolved maximum clade credibility phylogeny is shown and identified variants are highlighted and annotated with visually friendly colors.
FIGURE 4A demonstration to identify SARS-CoV-2 variants prevalent in India using the weighted network. (A) Weekly distribution of SARS-CoV-2 genome sequences according to sampling time (top) and the heatmap of module-based variant prevalence (bottom). Core mutations within each module were used to define the variants. The modules were reordered and colored according to their module clusters and time course. The weeks when Delta (B.1.617.2) was identified as a prevalent variant by network model (green) or reported as a VUI by WHO (blue) are highlighted by rectangles. (B) Phylogenic evaluation of detected endemic SARS-CoV-2 variants in South Africa. The detected variants are highlighted and annotated with visually friendly colors.
FIGURE 5SARS-CoV-2 variant surveillance in South Africa using the weighted network. (A) Weekly distribution of SARS-CoV-2 genome sequences according to sampling time (top) and the heatmap of module-based variant prevalence (bottom). Core mutations within each module were used to define the variants. The modules were reordered and colored according to their module clusters and time course. The weeks when Omicron (B.1.1.529) and Beta (B.1.351) were identified as prevalent variants by network model (green) or reported as VUI or VOC by WHO (blue) are highlighted by rectangles. (B–E) Phylogenic evaluation of every major SARS-CoV-2 variant detected in South Africa, including Omicron (B), Delta (C), Beta (D) and C.1 (E). The module-based variants having consistent classification with Pangolin lineages are labeled.