| Literature DB >> 30240114 |
Laura M Bergner1,2, Richard J Orton1,2, Ana da Silva Filipe2, Andrew E Shaw2, Daniel J Becker3,4,5, Carlos Tello6,7, Roman Biek1, Daniel G Streicker1,2.
Abstract
Microbial communities play an important role in organismal and ecosystem health. While high-throughput metabarcoding has revolutionized the study of bacterial communities, generating comparable viral communities has proven elusive, particularly in wildlife samples where the diversity of viruses and limited quantities of viral nucleic acid present distinctive challenges. Metagenomic sequencing is a promising solution for studying viral communities, but the lack of standardized methods currently precludes comparisons across host taxa or localities. Here, we developed an untargeted shotgun metagenomic sequencing protocol to generate comparable viral communities from noninvasively collected faecal and oropharyngeal swabs. Using samples from common vampire bats (Desmodus rotundus), a key species for virus transmission to humans and domestic animals, we tested how different storage media, nucleic acid extraction procedures and enrichment steps affect viral community detection. Based on finding viral contamination in foetal bovine serum, we recommend storing swabs in RNAlater or another nonbiological medium. We recommend extracting nucleic acid directly from swabs rather than from supernatant or pelleted material, which had undetectable levels of viral RNA. Results from a low-input RNA library preparation protocol suggest that ribosomal RNA depletion and light DNase treatment reduce host and bacterial nucleic acid, and improve virus detection. Finally, applying our approach to twelve pooled samples from seven localities in Peru, we showed that detected viral communities saturated at the attained sequencing depth, allowing unbiased comparisons of viral community composition. Future studies using the methods outlined here will elucidate the determinants of viral communities across host species, environments and time. 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.Entities:
Keywords: zzm321990Desmodus rotunduszzm321990; microbial community; shotgun metagenomics; virome
Mesh:
Year: 2018 PMID: 30240114 PMCID: PMC6378809 DOI: 10.1111/1755-0998.12946
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 7.090
Figure 1Sampling of vampire bat colonies used for enrichment and subsampling tests. Individual colonies are represented as white points and midpoints for each pool, in which one to two colonies were combined, are represented as circles (faeces) or triangles (saliva). Colony names are shown in the same colour as the pools in which they are included. Peru country borders and departments within Peru where samples were collected are outlined in white. The inset map shows South America, with Peru highlighted in the grey box [Colour figure can be viewed at wileyonlinelibrary.com]
Multi‐colony pools sequenced for enrichment tests and subsampling. Pools were created by combining nucleic acid from 10 individual swabs of the same sample type from the same site or locality
| Pool ID | Sample type | Raw reads | Viral reads | Colony 1 | Colony 2 | Test | Treatment | Subsampled |
|---|---|---|---|---|---|---|---|---|
|
| Faeces | 12,166,001 | 10,870 | AYA7 | AYA14 | – | – | Y |
|
| Saliva | 9,507,979 | 431 | AYA7 | AYA14 | – | – | Y |
|
| Faeces | 12,000,988 | 2,417 | API1 | AYA11 | – | – | Y |
|
| Saliva | 15,121,355 | 609 | API1 | AYA11 | – | – | Y |
| AMA_L_ F_NR | Faeces | 17,827,799 | 2,062 | AMA2 | AMA6 | rRNA | Non‐enriched | N |
|
| Faeces | 17,760,709 | 28,344 | AMA2 | AMA6 | rRNA | Enriched | N |
|
| Saliva | 9,363,273 | 305 | AMA2 | AMA4 | – | – | Y |
| CAJ_L_F_NR | Faeces | 15,940,753 | 1,179 | CAJ4 | – | rRNA | Non‐enriched | N |
|
| Faeces | 15,843,806 | 5,945 | CAJ4 | – | rRNA | Enriched | N |
|
| Saliva | 8,685,456 | 600 | CAJ4 | – | – | – | Y |
|
| Faeces | 8,661,617 | 8,085 | CAJ1 | CAJ2 | DNase | Light | Y |
| CAJ_H_F_2 | Faeces | 9,272,152 | 8,187 | CAJ1 | CAJ2 | DNase | Harsh | Y |
|
| Saliva | 11,830,542 | 534 | CAJ1 | CAJ2 | – | – | Y |
|
| Faeces | 10,814,816 | 11,285 | HUA1 | HUA2 | – | – | Y |
|
| Saliva | 8,931,393 | 517 | HUA1 | HUA2 | – | – | Y |
| LMA_L_F_NR | Faeces | 19,605,605 | 1,425 | LMA5 | LMA6 | rRNA | Non‐enriched | N |
|
| Faeces | 17,365,381 | 8,206 | LMA5 | LMA6 | rRNA | Enriched | N |
| LMA_L_SV_NR | Saliva | 18,698,730 | 75 | LMA5 | LMA6 | rRNA | Non‐enriched | N |
|
| Saliva | 15,953,442 | 483 | LMA5 | LMA6 | rRNA | Enriched | N |
| LR_L_F_NR | Faeces | 19,531,234 | 1,535 | LR1 | LR2 | rRNA | Non‐enriched | N |
|
| Faeces | 13,843,629 | 4,544 | LR1 | LR2 | rRNA | Enriched | N |
|
| Saliva | 9,023,821 | 478 | LR1 | LR2 | – | – | Y |
All pool IDs reflect the locality (AAC, Ayacucho‐Apurímac‐Cusco; AMA, Amazonas; CAJ, Cajamarca; HUA, Huánuco; LMA, Lima; LR, Loreto) and sample type (F, faeces; SV, saliva). Some IDs also reflect elevation (H, high; L, low) to differentiate localities with multiple pools. NR and R correspond to ribosomal treatment, either non‐enriched or enriched, and one sample (CAJ_H_F) has associated numbers (1 and 2) referring to two batches that received different treatments during viral enrichment. Pools processed using the final protocol are shown in bold.
Colony codes correspond to department within Peru. Colony locations and pool midpoints are shown in Figure 1.
Enrichment tests are abbreviated as rRNA (ribosomal RNA depletion) and DNase (light or harsh DNase treatment).
Viral families detected from shotgun metagenomic sequencing of FBS. For each viral family, the number of reads and contigs is reported for each of the two batches of FBS that were analysed
| Family | FBS1 | FBS2 | ||
|---|---|---|---|---|
| Reads | Contigs | Reads | Contigs | |
|
| 27 | 0 | 40 | 2 |
|
| 2 | 0 | 0 | 0 |
|
| 52 | 2 | 10 | 0 |
|
| 29 | 0 | 47 | 5 |
|
| 73 | 4 | 32 | 2 |
|
| 2 | 2 | 6 | 1 |
|
| 1 | 0 | 0 | 0 |
|
| 4 | 0 | 0 | 0 |
|
| 9 | 0 | 0 | 0 |
|
| 180 | 20 | 104 | 15 |
|
| 2 | 0 | 2 | 0 |
|
| 0 | 0 | 1 | 0 |
|
| 950 | 15 | 267 | 11 |
|
| 8 | 0 | 0 | 0 |
| Total viral reads | 1,373 | 516 | ||
| Raw reads | 13,565,793 | 7,935,389 | ||
FBS1 and FBS2 were two different batches of FBS that were sequenced.
Number of reads assigned to families do not add up to the total number of viral reads as some were classified as viral but not assigned to a family.
Summary of mock swabs tested for different extraction methods using qPCR. Swabs were inoculated with Schmallenberg virus and final virus concentration following extraction was measured using qPCR for different swab types and initial quantities of virus
| Swab type | Virus concentration (copies/ml) | Initial swab quantity (copies) | Extraction Replicate | Average | Average qPCR copies ( |
|---|---|---|---|---|---|
| Wooden‐base | 104 | 220 | 1 | 37.44 (0.45) | 0.67 (0.20) |
| 2 | No | No | |||
| 3 | 36.69 (0.72) | 1.16 (0.55) | |||
| Wooden‐base | 105 | 2,200 | 1 | 33.86 (0.36) | 7.84 (1.94) |
| 2 | 33.74 (0.6) | 8.83 (3.79) | |||
| 3 | 36.98 (0.57) | 0.94 (0.37) | |||
| Aluminium‐base | 105 | 2,200 | 1 | 34 (0.12) | 7 (0.59) |
| Wooden‐base | 106 | 22,000 | 1 | 33.49 (0.35) | 10.13 (2.4) |
| 2 | 31.72 (0.13) | 33.70 (2.86) | |||
| 3 | 32.92 (0.32) | 14.90 (3.06) |
SD: standard deviation.
Indicates only two of the three qPCR replicates were measurable (one replicate was below the limit of detection). When all three qPCR replicates were below the limit of detection, this is indicated with no C t. All other average Ct and average qPCR quantities are calculated based on three qPCR replicates.
Virus concentration and initial swab quantities are calculated based on qPCR measurements of undiluted virus, which was then diluted to obtain the concentrations used in this experiment.
Figure 2Comparison of viral reads and viral families in ribosomal depletion enrichment treatments. (a) Comparisons are shown for number of viral reads as log [N +1]) in enriched (N = 5) and non‐enriched (N = 5). (b) Total viral families and vertebrate‐infecting viral families detected in samples enriched by rRNA depletion (N = 5) compared to non‐enriched samples (N = 5)
Figure 3Comparison of reads per vertebrate‐infecting viral family across samples. Comparisons are shown for reads per vertebrate‐infecting viral family summed across samples enriched by ribosomal depletion (N = 5) and non‐enriched samples (N = 5). Read number comparison is shown for summed reads (as opposed to the mean) to enable visualization on a log scale
Figure 4The number of total viral families and vertebrate‐infecting viral families detected after light and harsh DNase treatments. Comparisons show a single split sample, with half receiving light DNase treatment and half harsh DNase treatment
Figure 5Viral reads increase proportionally to the percentage of raw reads analysed. The number of reads assigned as viral for faecal (N = 5) and saliva (N = 7) samples is shown at increasing percentages of total raw reads. Five replicates of each sample are depicted using the same symbol and colour; colours correspond to localities shown in Figure 1 [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 6Viral communities saturate at high read depths. Panels show the number of viral families and vertebrate‐infecting viral families detected in faecal (N = 5) and saliva (N = 7) samples at increasing percentages of the total raw reads. Percentage reads are z‐score standardized by subtracting the mean and dividing by the standard deviation. Points, which are semi‐transparent to indicate density, show the rescaled original data and lines show the model prediction
Model comparison for viral family detection in subsampling analyses. Linear and polynomial models were compared for each sample type (faeces and saliva) and filtering (all viral families and vertebrate‐infecting only) combination at the family level. For each combination, two models were run and compared through both likelihood ratio test (L, χ2, df and p‐value) and AIC (AIC and ΔAIC)
| Model |
| χ2 |
|
| AIC | ΔAIC | |
|---|---|---|---|---|---|---|---|
| Faecal viral families | Linear | −556.1 | 17.271 | 1 | 3.24E−05 | 1,118.2 | 15.271 |
| Polynomial | −547.47 | 1,102.9 | |||||
| Saliva viral families | Linear | −772.02 | 18.304 | 1 | 1.88E−05 | 1550 | 16.304 |
| Polynomial | −762.87 | 1533.7 | |||||
| Vertebrate‐infecting faecal viral families | Linear | −407.39 | 10.356 | 1 | 0.00129 | 820.79 | 8.3564 |
| Polynomial | −402.22 | 812.43 | |||||
| Vertebrate‐infecting saliva viral families | Linear | −573.15 | 0.8262 | 1 | 0.3634 | 1,152.3 | 1.174 |
| Polynomial | −572.73 | 1,153.5 |