| Literature DB >> 35064076 |
Johan Ringlander1, Joshua Fingal1, Hanna Kann2, Kasthuri Prakash1, Gustaf Rydell1, Maria Andersson1, Anna Martner1, Magnus Lindh1, Peter Horal1, Kristoffer Hellstrand3, Michael Kann1.
Abstract
Adenosine deaminases acting on RNA (ADAR) are RNA-editing enzymes that may restrict viral infection. We have utilized deep sequencing to determine adenosine to guanine (A→G) mutations, signifying ADAR activity, in clinical samples retrieved from 93 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)-infected patients in the early phase of the COVID-19 pandemic. A→G mutations were detected in 0.035% (median) of RNA residues and were predominantly nonsynonymous. These mutations were rarely detected in the major viral population but were abundant in minor viral populations in which A→G was more prevalent than any other mutation (P < 0.001). The A→G substitutions accumulated in the spike protein gene at positions corresponding to amino acids 505 to 510 in the receptor binding motif and at amino acids 650 to 655. The frequency of A→G mutations in minor viral populations was significantly associated with low viral load (P < 0.001). We additionally analyzed A→G mutations in 288,247 SARS-CoV-2 major (consensus) sequences representing the dominant viral population. The A→G mutations observed in minor viral populations in the initial patient cohort were increasingly detected in European consensus sequences between March and June 2020 (P < 0.001) followed by a decline of these mutations in autumn and early winter (P < 0.001). We propose that ADAR-induced deamination of RNA is a significant source of mutated SARS-CoV-2 and hypothesize that the degree of RNA deamination may determine or reflect viral fitness and infectivity.Entities:
Keywords: ADAR; RNA deamination; RNA mutation; SARS-CoV-2
Mesh:
Substances:
Year: 2022 PMID: 35064076 PMCID: PMC8833170 DOI: 10.1073/pnas.2112663119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Demographical and clinical parameters of patients
| Mild | Moderate | ICU | Deceased | Overall | P | Test | |
| ( | ( | ( | ( | ( | |||
| Age, years | |||||||
| Median | 52 | 74.5 | 67 | 79 | 64 | 0.002 | Kruskal–Wallis |
| IQR | 41.8–66.0 | 48.5–80.8 | 54.0–72.0 | 73.25–86.8 | 49.0–78.0 | ||
| Sex | |||||||
| Male (%) | 16 (44.4) | 5 (50.0) | 10 (76.9) | 7 (70.0) | 38 (55.1) | 0.16 | Fisher's exact |
| Female (%) | 20 (55.6) | 5 (50.0) | 3 (23.1) | 3 (30.0) | 31 (44.9) | ||
| Fever, N | |||||||
| Yes (%) | 4 (40.0) | 4 (57.1) | 5 (83.3) | 3 (75.0) | 16 (59.3) | 0.33 | Fisher's exact |
| No (%) | 6 (60.0) | 3 (42.9) | 1 (16.7) | 1 (25.0) | 11 (40.7) | ||
| N/A | 26 | 3 | 7 | 6 | 42 | ||
| CRP, mg/L | |||||||
| Median | 33 | 110 | 160 | 240 | 103 | 0.002 | Kruskal–Wallis |
| IQR | 5.5–63.0 | 43.0–180.0 | 78.0–260.0 | 215.0–260.0 | 34.5–212.5 | ||
| N/A | 25 | 1 | 4 | 3 | 33 | ||
| Days from symptom onset to sampling | |||||||
| Median | 4 | 6.5 | 6 | 3 | 5 | 0.45 | Kruskal–Wallis |
| IQR | 3.3–6.5 | 2.3–10.0 | 5.0–9.3 | 3.0–5.0 | 3.0–8.5 | ||
| N/A | 22 | 4 | 7 | 5 | 38 | ||
| Comorbidity, N | |||||||
| Yes (%) | 8 (22.2) | 4 (40.0) | 6 (46.2) | 3 (30.0) | 21 (30.4) | 0.38 | Fisher's exact |
| No (%) | 28 (77.8) | 6 (60.0) | 7 (53.8) | 7 (70.0) | 48 (69.6) | ||
| Viral load (log10 genome copies/swab) | |||||||
| Median | 7.2 | 6.0 | 6.4 | 7.8 | 6.9 | 0.12 | Kruskal–Wallis |
| IQR | 5.3–8.6 | 5.6–8.5 | 5.2–6.8 | 6.9–8.8 | 5.5–8.5 | ||
*ICU: admitted to intensive care unit.
†P: probability value of differences between groups. The applied tests are shown on the right.
‡IQR: Interquartile range.
§Not stated in patient medical records.
¶CRP value in patient plasma samples, indicating level of systemic inflammation.
Fig. 1.A→G mutations in minor viral populations of SARS-CoV-2. (A) Frequency of specific mutations in the SARS-CoV-2 genome. Results show the frequency of mutation, grouped by A, C, G, and U mutations, out of all mutations detected by Ion Torrent sequencing of the SARS-CoV-2 genome. The analysis was based on 98,325 changed residues that were detected from 5.04 × 109 reads in 69 patient samples. Twenty-five percent of the changed residues were A→G mutations that constituted 0.035% of all reads. The difference between the proportion of A→G mutation and any other mutation was significant (P < 0.001, χ2 test). (B) Correlation between A→G mutations in minor viral populations of SARS-CoV-2 and viral load. Results show the viral load (log10 genome copies per swab) vs. A→G mutations in percent of all reads. Dots represent samples retrieved at first visit. Data were obtained by Ion Torrent sequencing in samples that passed sequencing quality thresholds and a viral load exceeding 4.5 log10 viral genome copies per mL. Viral load was analyzed by real-time PCR. (C) Distribution of A→G mutations in the spike region. Results show the A→G mutation frequency of minor viral populations within the sequenced part of the spike-encoding region of the SARS-CoV-2 genome. The x axis shows nucleotide positions and the corresponding amino acid residues of spike. The y axis shows the percentage of A→G mutations of adenosines merged mean mutational frequency over 20 nt. (D) Spike protein structure. The figure is a 3D visualization of the trimeric spike protein (21). Orange dots (amino acids G505C, G506R/S, Y508C, R509G, and Y523A) represent nucleotide A→G mutations associated with viral load in patient samples and amino acid changes in the RBM that were predicted to cause structural changes. Gray dots indicate A→G mutations with ensuing amino acid changes in the RBD that were not predicted to impact on the structure of RBD.
Association between A→G mutations on SARS-CoV-2 viral load
| Viral load | N | Ns | A→G mutation frequency median (%) | P |
| At first sampling | ||||
| Less than median | 34 | 0.031 (0.022–0.038) | <0.001 | |
| Greater than or equal to median | 35 | 0.020 (0.016–0.024) | ||
| At follow-up | ||||
| Less than median | 13 | 0.036 (0.026–0.047) | 0.044 | |
| Greater than or equal to median | 6 | 0.022 (0.019–0.027) |
*Number of patients and initial samples.
†Number of follow-up samples.
‡P value of Mann–Whitney U test regarding distribution of A→G mutation vs. viral load of samples.
§Viral load values at follow-up were stratified by the median viral load value of the baseline cohort.
Fig. 2.Prevalence of A→G mutations in SARS-CoV-2 genome residues vs. COVID-19 mortality in Europe. Consensus sequences of SARS-CoV-2 genomes from 186,616 European patients, sampled between March and December 2020, were retrieved from the GISAID database. The left y axis shows the percent of circulating genomes, and the right y axis shows the number of deaths. The blue line shows the frequency of SARS-CoV-2 genomes sampled at indicated time points (pooled weekly) that harbored at least one out of the 84 A→G mutated sites more commonly found in samples with low viral load in the previous deep sequence patient samples (). The black line shows the same results by pooled 4-wk intervals. The magenta line shows A→C mutation-bearing SARS-CoV-2 genomes (weekly), and the green line shows A→U mutation-bearing genomes (weekly). The reduction of the frequency of A→G mutations between August and December was significant (P < 0.001, χ2 test). The red line shows the number of deaths caused by COVID-19 in Europe (European Centre for Disease Prevention and Control data collection https://www.ecdc.europa.eu/en/covid-19/data-collection) from March 1 to December 14, 2020), which was significantly associated with the frequency of A→G mutations (P = 0.004).