| Literature DB >> 27094861 |
José M Cuevas1, Marine Combe1, Manoli Torres-Puente2, Raquel Garijo1, Susana Guix3, Javier Buesa4, Jesús Rodríguez-Díaz4, Rafael Sanjuán5.
Abstract
Human noroviruses (NoVs) are a major cause of gastroenteritis worldwide. It is thought that, similar to other RNA viruses, high mutation rates allow NoVs to evolve fast and to undergo rapid immune escape at the population level. However, the rate and spectrum of spontaneous mutations of human NoVs have not been quantified previously. Here, we analyzed the intra-patient diversity of the NoV capsid by carrying out RT-PCR and ultra-deep sequencing with 100,000-fold coverage of 16 stool samples from symptomatic patients. This revealed the presence of low-frequency sequences carrying large numbers of U-to-C or A-to-G base transitions, suggesting a role for hyper-mutation in NoV diversity. To more directly test for hyper-mutation, we performed transfection assays in which the production of mutations was restricted to a single cell infection cycle. This confirmed the presence of sequences with multiple U-to-C/A-to-G transitions, and suggested that hyper-mutation contributed a large fraction of the total NoV spontaneous mutation rate. The type of changes produced and their sequence context are compatible with ADAR-mediated editing of the viral RNA.Entities:
Keywords: Hyper-mutation; Next-generation sequencing; Norovirus; RNA virus
Mesh:
Substances:
Year: 2016 PMID: 27094861 PMCID: PMC7172324 DOI: 10.1016/j.meegid.2016.04.017
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Fig. 1NoV genetic map, regions sequenced, and setup of transfection assays. A. In the NoV genetic map, the VP1 capsid gene is shown in red. Molecular clones encompassing the entire VP1 gene were sequenced by the Sanger method. Illumina sequencing was used to analyze smaller regions mapping to the S domain of VP1 and the hyper-variable domain P2 (dark red bars). B. An infectious cDNA clone was transfected in HEK293 cells previously infected with a recombinant vaccinia virus expressing T7 RNA polymerase, allowing for transcription of plus-strand NoV genomic RNA. A primer annealing to minus-strand copies was used for RT-PCR amplification and sequencing. Colored circles represent mutations/variants.
Hyper-mutant (HM) sequences found in stool samples after RT-PCR of two VP1 regions.
| Sample | Region 1 (341 bases) | Region 2 (409 bases) | ||||||
|---|---|---|---|---|---|---|---|---|
| Total reads | HM type | HM reads | Mutations/read (mean, min, max) | Total reads | HM type | HM reads | Mutations/read (mean, min, max) | |
| 3106 | 99,176 | A-to-G | 12 | 7.2 (5–10) | 96,415 | U-to-C | 39 | 5.5 (5–7) |
| 3142 | 97,757 | U-to-C | 27 | 10.6 (5–30) | N/A | – | – | – |
| 3210 | N/A | – | – | – | 96,770 | U-to-C | 13 | 5.6 (5–6) |
| 3213 | 98,967 | A-to-G | 13 | 7.8 (5–12) | 95,936 | A-to-G | 8 | 17.5 (6–37) |
Reads with an average base quality (Sanger-scaled Phred) score Q > 28 at mutated positions.
RT-PCR failed.
Fig. 2Distribution of U-to-C mutations along a VP1 region in sequences derived from transfected HEK293 cells. The alignment on top shows two examples of highly mutated reads from each transfection assay. The heat map below indicates, for each nucleotide site, the total number of deep-sequencing reads carrying a U-to-C mutation (see color legend).
Fig. 3Analysis of hyper-mutation patterns. A. Distribution of the number of U-to-C mutations per deep sequencing read in each of the three transfection assays. The red histograms show the observed counts and the blue line indicates the counts expected from a Poisson model of rare random events. The single parameter of the Poisson distribution was calibrated using the number of reads carrying zero or one mutations. The strong deviation between observed and expected counts shows that sequence reads carrying multiple mutations were more frequent than expected from the Poisson model. Based on this, hyper-mutated reads were defined as those carrying five or more mutations. B. Reproducibility of U-to-C mutation frequency in three transfection assays. In the graphs, each data point corresponds to an U-containing nucleotide site, and the number of times a U-to-C mutation was observed in deep-sequencing reads is plotted for each pair of transfection assays (also represented in the heat map of Fig. 2). From left to right, Spearman correlations were 0.860, 0.855, and 0.955 (p < 10− 12 in all cases). C. Neighbor base preferences for U-to-C and A-to-G hyper-mutation. The histograms show the frequency of U, A, G, and C among 3´neighbors of U-to-C mutations (left), and the frequency of U, A, G, and C among 5´neighbors of A-to-G mutations (right). The crossed lines indicate these same frequencies among non-mutated bases (null expectation). The error bars indicate the SEM frequency from three transfection assays.
Hyper-mutant (HM) VP1 sequences (bases 38-303) derived from transfected HEK293 cells.
| Assay | NGS valid reads | U-to-C HM | A-to-G HM | ||||
|---|---|---|---|---|---|---|---|
| Number of reads | Mutations/read (mean, min, max) | Mutation rate | Number of reads | Mutations/read (mean, min, max) | Mutation rate | ||
| 1 | 99,982 | 266 | 8.7 (5–28) | 8.7 × 10− 5 | 67 | 7.8 (5–14) | 2.0 × 10− 5 |
| 2 | 99,917 | 481 | 8.0 (5–20) | 1.4 × 10− 4 | 0 | NA | 0 |
| 3 | 99,973 | 697 | 9.2 (5–31) | 2.1 × 10− 4 | 16 | 7.1 (5–18) | 4.3 × 10− 6 |
Reads with an average base quality (Sanger-scaled Phred) score Q > 28 at mutated positions.
Mutation rates were estimated by dividing, for a giving assay, the total number of mutations by the product of reads times the length of the PCR product (266 bases).
Mutations found by Sanger sequencing in NoV VP1 molecular clones derived from transfected HEK293 cells.
| Total | Without hyper-mutant | |
|---|---|---|
| VP1 clones sequenced | 64 | 63 |
| Total bases sequenced | 136,032 | 133,901 |
| Total mutations | 21 | 3 |
| U-to-C substitutions | 18 | 0 |
| Total mutation frequency | 1.5 × 10− 4 | 2.2 × 10− 5 |