| Literature DB >> 35663867 |
Jessica E Agius1,2,3, Jessica C Johnson-Mackinnon1,2,3, Winkie Fong1,2, Mailie Gall3,4, Connie Lam1, Kerri Basile4, Jen Kok1,4, Alicia Arnott1,3,4, Vitali Sintchenko1,2,3,4, Rebecca J Rockett1,2,3.
Abstract
Background: Low frequency intrahost single nucleotide variants (iSNVs) of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) have been increasingly recognised as predictive indicators of positive selection. Particularly as growing numbers of SARS-CoV-2 variants of interest (VOI) and concern (VOC) emerge. However, the dynamics of subgenomic RNA (sgRNA) expression and its impact on genomic diversity and infection outcome remain poorly understood. This study aims to investigate and quantify iSNVs and sgRNA expression in single and longitudinally sampled cohorts over the course of mild and severe SARS-CoV-2 infection, benchmarked against an in vitro infection model.Entities:
Keywords: COVID-19; SARS-CoV-2; dynamics; evolution; iSNV; sub-genomic RNA; variants; within-host diversity
Year: 2022 PMID: 35663867 PMCID: PMC9161297 DOI: 10.3389/fmicb.2022.824217
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
FIGURE 1Genomic and epidemiological diversity of clinical and culture SARS-CoV-2 sequences. The SARS-CoV-2 phylogeny was constructed in a local version of Nextstrain (auspice.us) and shows the evolutionary relationship of SARS-CoV-2 genomes sequenced within this study (black dots) and a subsample of publicly available SARS CoV-2 genomes collected globally (n = 1,079). The tree branch lengths represent divergence, with sequences rooted relative to early samples collected in Wuhan, China (Wuhan-Hu-1/2019).
Demographics of SARS-CoV-2 positive cases within the severe and mild cohorts.
| Cohort | Total cases | Total samples | Lineages ( | Gender M:F | Age median range |
| Severe disease | 16 | 26 | B.1 (7), B.6 (4), B.1.617.2 (13), A.2.2 (1), A (1) | 8M:8F | Median = 65; Range 27–94 |
| Mild | 23 | 25 | D.2 (11), B.1 (6), B.1.617.2 (7), B.1.1 (1). | 6M:15F | Median = 56; Range 12–71 |
| Total | 39 | 51 | A (1), A.2.2 (1), B.1 (13), B.1.1 (1), B.1.617.2 (20), B.6 (4), D.2 (11) | 14M:23F | Median = 58; Range 12–94 |
M, male; F, female.
FIGURE 2iSNV and SNP frequencies and the corresponding synonymous and non-synonymous mutations across the SAR-CoV-2 genomes for the (A) severe and (B) mild cohorts, and (C) culture dilutions. A frequency of ≥ 0.9 was considered a SNP (red), frequencies below 0.05 were not included in the analysis. Identified problematic sites are outlined in black and denoted with an * on the x-axis. Amino acid changes for non-synonymous mutations in the spike region are in bold. The spike gene is highlighted in pink. SARS-CoV-2 Delta lineages are highlighted in grey. Horizontal black bars group samples from the same patient. Descriptors A, B, and D in panel (C) refer to Lineage A, B.1.351, and B.1.617.2, respectively. Bracketed numbers indicate the date of sample collection post-symptom onset.
FIGURE 3Frequencies of SNPs (A) and iSNVs (B) by SARS-CoV-2 gene for severe (red) and mild (green) cohorts. Frequencies ≥ 0.9 were considered SNPs. Problematic sites are not included. NC signifies non-coding region of the genome. Statistical significance (p ≤ 0.05) is denoted by (*). The frequency of iSNVs in the spike gene of SARS-CoV-2 were significantly different between the severe and mild cohorts.
FIGURE 4Counts of SNPs (A) and iSNVs (B) by SARS-CoV-2 gene for severe (red) and mild (green) cohorts. Frequencies ≥ 0.9 were considered SNPs. Problematic sites are not included. Statistical significance (p ≤ 0.05) is denoted by (*) and NC signifies non-coding region of the genome. The number of iSNVs in the ORF1ab gene of SARS-CoV-2 were significantly different between the severe and mild cohorts.
FIGURE 5Distribution of sgRPTL counts across all genes. sgRNA was detected in in the mild (red) and severe (green) clinical cohorts. Statistical significance (p ≤ 0.05) is denoted by (*). S sgRNA detected in n = 24/26 severe and n = 21/25 mild genomes, ORF3a in 24/26 severe genomes and 17/25 mild genomes, E in 1/26 severe genomes and 6/25 mild genomes, M in 20/26 severe genomes and 15/25 mild genomes, ORF6 in 1/26 severe genomes and 11/25 mild genomes, ORF7a in 24/26 severe genomes and 18/25 mild genomes, ORF8 in 22/26 severe genomes and 20/25 mild genomes and N in 25/26 severe genomes and 24/25 mild genomes.