| Literature DB >> 32894206 |
Xiaofeng Dong1, Jordana Munoz-Basagoiti1,2, Natasha Y Rickett1,2, Georgios Pollakis1,2, William A Paxton1,2, Stephan Günther3, Romy Kerber3, Lisa F P Ng1,2,4, Michael J Elmore5, N'faly Magassouba6, Miles W Carroll2,5, David A Matthews7, Julian A Hiscox8,9,10.
Abstract
BACKGROUND: Viral load is a major contributor to outcome in patients with Ebola virus disease (EVD), with high values leading to a fatal outcome. Evidence from the 2013-2016 Ebola virus (EBOV) outbreak indicated that different genotypes of the virus can have different phenotypes in patients. Additionally, due to the error-prone nature of viral RNA synthesis in an individual patient, the EBOV genome exists around a dominant viral genome sequence. The minor variants within a patient may contribute to the overall phenotype in terms of viral protein function. To investigate the effects of these minor variants, blood samples from patients with acute EVD were deeply sequenced.Entities:
Keywords: Ebola virus; Ebola virus disease; Evolution; Patient outcome; Virus genetics
Year: 2020 PMID: 32894206 PMCID: PMC7475720 DOI: 10.1186/s13059-020-02148-3
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Ebola genome-wide mutational bias and viral load. a Comparison of the viral load (1/Ct) in hospitalised fatal versus hospitalised survived blood samples taken during the acute phase upon admission to an Ebola virus treatment centre. These data followed the normal distribution, so P values were calculated with a two-sided t test. b Comparison of the nucleotide variation from the dominant genome sequence in an individual patient in hospitalised fatal versus hospitalised survived cases. The variation frequency was calculated by transversion or transition deviations using a 200-nucleotide sliding window, and the P values were calculated with a one-sided Wilcoxon rank sum test as the data did not fit a normal distribution. c Q-Q plots were used to compare the distribution of the average nucleotide deviation in an individual patient in hospitalised fatal versus hospitalised survived cases using a 200-nucleotide sliding window along the genome for hospitalised survived versus hospitalised fatal cases. d Average nucleotide variation along the Ebola genome calculated by substitutions leading to either transversion or transition changes using a 200-nucleotide sliding window. e Comparison of the viral load (1/Ct) in hospitalised fatal versus hospitalised survived cases. A two-sided Spearman rank correlation test was used to estimate the correlation of the average nucleotide variation and viral load (1/Ct) of each sample from a patient, where the R value is the correlation coefficient ranging in − 1 (strong negative correlation) and + 1 (strong positive correlation), and P is the P value for this test
Fig. 2Analysis of non-synonymous changes and their correlation with viral load in acute patients. a The average non-synonymous variation in codon frequency at every amino acid site of each EBOV protein. b Comparison of the non-synonymous nucleotide substitution frequency in the L protein at positions 572, 986 and 2061, and the P values were calculated with a one-sided Wilcoxon rank sum test. c A two-sided Spearman rank correlation test was used to estimate the correlation of average non-synonymous deviation in viral genomes with viral load (1/Ct) at positions 572, 986 and 2061 in patients who were either hospitalised fatal versus hospitalised survived cases, where the R value is the correlation coefficient ranging in − 1 (strong negative correlation) and 1 (strong positive correlation), and P is P value for this test
Fig. 3Comparison between Ts and Tv ratios that resulted in a non-synonymous change in positions 572, 986 and 2061 in the L protein and position 28 in VP24. The P values were calculated with a one-sided Wilcoxon rank sum test
Fig. 4a Comparison of three amino acid variation frequencies in the L protein at positions 572, 986 and 2061. P values were calculated with the one-tailed Wilcoxon rank sum test. b A Spearman rank correlation test was used to estimate the correlation of these three amino acid variation frequencies with viral load (1/Ct) at positions 572, 986 and 2061, where the R value is the correlation coefficient ranging from − 1 (strong negative correlation) to + 1 (strong positive correlation), and P is the P value for this test. In a and b, only the samples with at least amino acid variation are shown
Fig. 5Analysis of the frequency of stop codon substitution in viral proteins and viral load. a Comparison of stop codon frequency in the L protein at position 986. P values were calculated with the one-sided Wilcoxon rank sum test, with the average stop codon frequency at position 986 in the EBOV L protein and compassion between hospitalised fatal and hospitalised survived cases. b A Q-Q plot was used to compare the distribution of the stop codon at position 986 in the L protein between hospitalised fatal and hospitalised survived cases. The values below the line suggest the data, i.e. the presence of the stop codon, was more frequent in the hospitalised survivor cases. c The summary of stop codon frequency in all EBOV proteins compared between hospitalised fatal and hospitalised survived cases. d A two-sided Spearman rank correlation test was used to estimate the correlation of stop codon frequency with viral load (1/Ct) at position 986, where the R value is the correlation coefficient ranging from − 1 (strong negative correlation) to + 1 (strong positive correlation), and P is the P value for this test. In b, d and e, only the samples with at least one stop codon are shown. In a, c and d, only the samples with at least one stop codon are shown
Fig. 6Functionality of LSTOP and L3mut in an EBOV transcription/replication plasmid-based system (mini-genome) in cell culture. a, b Schematic diagrams of the conserved domains (grey boxes), functional motifs (purple) and variable regions or hinges (discontinuous green line) in EBOV L3mut and LSTOP. This diagram is based on data for filovirus and mononegavirus models for the L protein. Conserved blocks I–III constitute a RdRp, which is closely associated with a capping domain (Cap). Block VI has methyltransferase (MTase) activity, and downstream of this is located in a small C-terminal domain (CTD). Red highlighted the amino acid position where the a three most frequently found amino acid changes in the L protein at positions 572, 98s and 2061 are located and b the truncated protein due to the stop codon in L. c EBOV mini-genome system activity at different ratios between EBOV L and LSTOP and d at equal EBOV L but different LSTOP amounts. Results are shown as the mean ± S.D. from one experiment performed in triplicate. ***P < 0.001; **P < 0.01; *P < 0.05. Western blot for luciferase (LUC), EBOV L/LSTOP and house-keeping GAPDH protein abundance in cells transfected with the mini-genome system. e EBOV mini-genome system activity in the presence of the L, no L and L3mut and at different ratios between L and L3mut. Blotting showed LUC and GAPDH abundance in cells transfected with the mini-genome system plasmids. f VP35-eGFP was used in a co-immunoprecipitation (coIP) assay to examine its interaction with EBOV LSTOP. Blotting showed the presence of eGFP and viral proteins VP35/eGFP, VP35, L and NP in the cell lysates (input (I)) and coIP fraction (eluate (E)). g Proposed model of the competition between EBOV L and LSTOP for the viral RdRp co-factor VP35 and the potential reduction in the EBOV RNA synthesis observed in patients with lower viral load (inset panel)