| Literature DB >> 28540488 |
Julia M Maritz1, Krysta H Rogers2, Tara M Rock1, Nicole Liu3, Susan Joseph1, Kirkwood M Land3, Jane M Carlton4.
Abstract
Microbial eukaryotes (protists) are important components of terrestrial and aquatic environments, as well as animal and human microbiomes. Their relationships with metazoa range from mutualistic to parasitic and zoonotic (i.e., transmissible between humans and animals). Despite their ecological importance, our knowledge of protists in urban environments lags behind that of bacteria, largely due to a lack of experimentally validated high-throughput protocols that produce accurate estimates of protist diversity while minimizing non-protist DNA representation. We optimized protocols for detecting zoonotic protists in raw sewage samples, with a focus on trichomonad taxa. First, we investigated the utility of two commonly used variable regions of the 18S rRNA marker gene, V4 and V9, by amplifying and Sanger sequencing 23 different eukaryotic species, including 16 protist species such as Cryptosporidium parvum, Giardia intestinalis, Toxoplasma gondii, and species of trichomonad. Next, we optimized wet-lab methods for sample processing and Illumina sequencing of both regions from raw sewage collected from a private apartment building in New York City. Our results show that both regions are effective at identifying several zoonotic protists that may be present in sewage. A combination of small extractions (1 mL volumes) performed on the same day as sample collection, and the incorporation of a vertebrate blocking primer, is ideal to detect protist taxa of interest and combat the effects of metazoan DNA. We expect that the robust, standardized methods presented in our workflow will be applicable to investigations of protists in other environmental samples, and will help facilitate large-scale investigations of protistan diversity.Entities:
Keywords: 18S rRNA amplicon sequencing; Environmental sequencing; Protist; Sewage; Trichomonad; Zoonoses
Mesh:
Substances:
Year: 2017 PMID: 28540488 PMCID: PMC5653731 DOI: 10.1007/s00248-017-0996-9
Source DB: PubMed Journal: Microb Ecol ISSN: 0095-3628 Impact factor: 4.552
Fig. 1Workflow employed for testing and optimization of protocols to detect microbial eukaryotes in raw sewage. Part 1 was used to evaluate two variable regions of the 18S rRNA gene, V4 and V9, and determine best practices for zoonotic protists. Part 2 was used to optimize the methods in Part 1 and develop experimental methods for Illumina sequencing of raw sewage samples. Methods that were tested but determined suboptimal are indicated in boxes with dotted lines and/or crossed out text
Fig. 2Phylogenetic tree of trichomonad Sanger sequences from a V4 region and b V9 region. Numbers at nodes correspond to Bayesian posterior probabilities. GenBank accession numbers are shown at the left for each taxon. Colored taxa are identical to each other and brackets indicate OTU membership and corresponding ID number from the supplementary material after clustering at 98% identity
DNA yield from July sewage samples with and without storage at −20 °C for 1 week
| Sample ID | Day extracted | Input volume | DNA yield (ng/μL) |
|---|---|---|---|
| S005_1_mL | Day of collection | 1 mL | 2.9 |
| S005_10_mL | Day of collection | 10 mL | 35.0 |
| S005_10_mL_frozen | After 7 days at −20 °C | 10 mL | 5.81 |
| S006_1_mL | Day of collection | 1 mL | 3.56 |
| S006_10_mL | Day of collection | 10 mL | 21.0 |
| S006_10_mL_frozen | After 7 days at −20 °C | 10 mL | 9.8 |
Reads returned after joining and/or de-multiplexing the V4 sequencing data
| Analysis step | Joined | Read 1 only | Read 1 trimmed |
|---|---|---|---|
| Raw sequences or pairs | 18,269,491 | 18,269,491 | 18,269,491 |
| Joined pairs of reads | 53,526 | NA | NA |
| De-multiplexed reads | 42,257 | 6,906,307 | 7,027,716 |
Summary of Illumina sequence data for each 18S region
| Analysis step | V4 region (read 1) | V9 region |
|---|---|---|
| Raw sequences or pairs | 18,269,491 | 29,400,737 |
| Joined pairs of reads | NA | 22,864,953 |
| De-multiplexed reads | 7,027,716 | 21,395,270 |
| Clustered reads (OTUs) | 6,477,688 (2270) | 21,205,088 (8827) |
| Eukaryotic reads (OTUs) | 6,477,688 (2270) | 18,794,581 (7195) |
| Filtered eukaryotic reads (OTUs) | 6,464,532 (824) | 18,664,054 (1444) |
| Average sequences per sample | 808,066 | 1,196,152 |
Fig. 3Alpha diversity analysis for sewage 18S communities using the phylogenetic diversity metric and Shannon Index. All analyses were calculated from replicate OTU tables sampled at a depth of 600,000 sequences. a Alpha diversity of the different extraction volumes for the V4 and V9 regions. Values shown represent the measurements from all 1 and 10 mL (fresh samples only) extraction volumes by region. Error bars represent ± one standard deviation; double asterisks indicates p value <0.001. b Alpha diversity of sewage samples based on extraction volume and blocking primer use for the V9 region. Asterisk indicates p value <0.01
Fig. 4Relative sequence abundances of eukaryotic taxa across experimental conditions for each region. Each bar represents the average of two replicates per condition per time point. All data shown were from fresh extractions. a V4 region results from the 1 and 10 mL extractions. b V9 region results from the 1 and 10 mL extractions (with (B) and without (N) the blocking primer)
Fig. 5LEfSe analysis of 18S communities to determine biomarker taxa across sample groupings for each region. All data shown were from fresh extractions. LDA scores for taxa identified as differentially abundant between a 10 and 1 mL for the V4 region and b 1 mL, 1 mL blocking, 10 mL, and 10 mL blocking samples for the V9 region. Taxa are ranked according to their effect size (LDA score) and associated with the group with the highest median