| Literature DB >> 31446609 |
Sofia Strubbia1, My V T Phan2, Julien Schaeffer1, Marion Koopmans2, Matthew Cotten2,3,4,5, Françoise S Le Guyader6.
Abstract
This study aimed to optimize a method to identify human enteric viruses in sewage and stool samples using random primed next-generation sequencing. We tested three methods, two employed virus enrichment based on the binding properties of the viral capsid using pig-mucin capture or by selecting viral RNA prior to library preparation through a capture using the SureSelect target enrichment. The third method was based on a non-specific biophysical precipitation with polyethylene glycol. Full genomes of a number of common human enteric viruses including norovirus, rotavirus, husavirus, enterovirus and astrovirus were obtained. In stool samples full norovirus genome were detected as well as partial enterovirus genome. A variety of norovirus sequences was detected in sewage samples, with genogroup II being more prevalent. Interestingly, the pig-mucin capture enhanced not only the recovery of norovirus and rotavirus but also recovery of astrovirus, sapovirus and husavirus. Documenting sewage virome using these methods provides information for molecular epidemiology and may be useful in developing strategies to prevent further spread of viruses.Entities:
Keywords: Human enteric viruses; Metagenomic; Norovirus; Sewage; Virome
Mesh:
Substances:
Year: 2019 PMID: 31446609 PMCID: PMC6848244 DOI: 10.1007/s12560-019-09402-3
Source DB: PubMed Journal: Food Environ Virol ISSN: 1867-0334 Impact factor: 2.778
Fig. 1Schematic of sample processing and sequencing used in this study. The three main frames on the top contain the method names (A, B and C), the number and the type of sample treated with each protocol. Key steps common to all protocols are represented in gray rectangle
Fig. 2Heat map of larger viral contig yield. Quality-controlled short-read data were de novo assembled into large contigs and contigs were classified at the protein level by comparison with virus family-specific reference sets (see Methods). The numbers of contigs in each sample with > 60% protein identity and 500 nt minimum contig length were plotted in a heat map. The colorbar to the right indicates number of contigs detected per sample in each virus family
Norovirus concentrations in wastewater samples used for this study and corresponding calicivirus reads
| Sample | Method | Norovirus* | Calicivirus | |
|---|---|---|---|---|
| GI | GII | reads | ||
| E1777 | A | 3 | 497 | NA |
| E1797 | A | 64 | 39 | NA |
| E1854 | A | 3 | 37 | NA |
| E1887 | B | 7 | 219 | 2507 |
| C | 2 | 223 | 13,548 | |
| E1919 | B | 19 | 361 | 3242 |
| C | 6 | 346 | 10,635 | |
| E1920 | B | 20 | 466 | 10,406 |
| C | 3 | 3962 | ||
| E1927 | B | 26 | 140 | 1750 |
| C | 11 | 637 | 2777 | |
NA data not collected
*Concentrations obtained using digital RT-PCR are expressed as RNA copies/µL for genogroup I (GI) and II (GII)
**Number of reads mapped on contigs > 500nt using Bowtie2
Norovirus strains identified in sewage samples using method A
| Sample | Contig length | Genotype* | Coverage** |
|---|---|---|---|
| E1777 | 7278 | GIIP21-GII.3 | 19.586 |
| 7484 | GIIP7-GII.6 | 99.576 | |
| 6828 | GIIPe-GII.4syd-v2 | 23.144 | |
| 7497 | GIIPe-GII4syd | 21.732 | |
| 7648 | GIP3-GI.3 | 16.817 | |
| E1797 | 7343 | GIIP17-GII.7 | 85.503 |
| 7431 | GIIP7-GII.6 | 47.253 | |
| 3467 | GIP9-GI.9 | 12.576 | |
| 7679 | GIPb-GI.6 | 71.396 | |
| E1854 | 1582 | GIIP16-GII.13 | 12.72 |
| 6509 | GIIP17-GII.17 | 8.225 | |
| 1221 | GIIPg | 8.663 | |
| 7641 | GIPb-GI.6 | 15.532 | |
| 3687 | GIV | 12.327 |
*Sequences were identified using the online norovirus genotyping tool v2.0 (Kroneman et al. 2011)
**Coverage was calculated using Bowtie2 (Langmead et al. 2012)
Norovirus and other human virus detected in stool samples using method B
| Sample | Virus | Contig length | Genotype |
|---|---|---|---|
| S570 | Norovirus | 7609 | GIIPe-GII.4 syd |
| S287 | Norovirus | 7542 | GIIP21-GII.3 |
| Norovirus | 1104 | GIIP17-GII.17 | |
| Enterovirus | 653 | coxsackievirus A9 | |
| S5 | Norovirus | 7390 | GIIP4NewOrleans-GII.4syd |
| Norovirus | 3399 | GII.P21-GII.3 | |
| Enterovirus | 1387 | coxsackievirus A9 | |
| S581 | Norovirus | 7528 | GIIP17-GII.17 |
| Enterovirus | 1421 | coxsackievirus A9 | |
| S582 | Norovirus | 7536 | GIIP17-GII.17 |
| Enterovirus | 1363 | coxsackievirus A9 |
Human enteric viruses detected using method B and C
| Sample | Method B | Method C | ||||
|---|---|---|---|---|---|---|
| Virus | Contig | Genotype | Virus | Contig | Genotype* | |
| S570 | Norovirus | 7609 | GIIPe-GII.4 | Norovirus | 6976 | GIIPe-GII.4 |
| E1887 | Sapovirus | 7414 | GI.2 | Sapovirus | 7491 | GI.2 |
| Norovirus | 3241 | GIIP22-GII.17 | Norovirus | 2366 | GII.P22 | |
| Norovirus | 2961 | GII | Norovirus | 2129 | GII | |
| Norovirus | 2996 | GII.P7-GII.17 | Norovirus | 3126 | GII.P17-GII.17 | |
| Norovirus | 3024 | GII.P17-GII.6 | ||||
| Rotavirus | 836 | Rotavirus | 3283 | G9[P8] | ||
| Astrovirus | 3860 | Type 2 | Astrovirus | 6519 | Type 2 | |
| Husavirus | 494 | Sp.16915_89 | ||||
| E1919 | Norovirus | 4598 | GII.P16-GII.2 | Norovirus | 1028 | GIIP16 |
| Norovirus | 2692 | GII | Norovirus | 3004 | GII.P16-GII.4 syd | |
| Sapovirus | 1209 | GII.3 | ||||
| Astrovirus | 6810 | Type 1 | Astrovirus | 6412 | Type 1 | |
| E1920 | Norovirus | 2239 | GII | Norovirus | 510 | GII.2 |
| Norovirus | 332 | GI–GI.3 | ||||
| Astrovirus | 6782 | Type 1 | Astrovirus | 1878 | Type 1 | |
| E1927 | Norovirus | 944 | GII.2 | Norovirus | 631 | GII.Postdam |
| Norovirus | 616 | GII | ||||
| Norovirus | 442 | GI | Norovirus | 631 | GI–GI.3 | |
| Sapovirus | 571 | GI.1 Seoul | ||||
| Astrovirus | 1878 | Type 1 | Astrovirus | 6588 | Type 5 | |
| Astrovirus | 3948 | Type 1 | ||||
| Aichi virus | 2692 | Type 1 | ||||
*Strain identification was assigned using the online Norovirus genotyping tool v2.0 (Kroneman et al. 2011)
Fig. 3Scatter plot of reads identified for each of 8 virus families (percentage of total reads). Quality-controlled reads for each of the 18 sample/conditions were mapped to virus family reference sets (see Methods) for Astroviridae, Caliciviridae, Nodaviridae, Leviviridae, Microviridae, Picornaviridae, Picobirnaviridae and Reoviridae. The number of reads identified in each family was plotted as percentage of the total quality-controlled reads for that read set. Data sets were separated into sample types (stool or sewage) and colored according sample treatment (method A: pink, method B: orange and method C: blue)