Literature DB >> 35727055

Full-Length 16S rRNA Gene Sequences from Raw Sewage Samples Spanning Geographic and Seasonal Gradients in Conveyance Systems across the United States.

Emily Lou LaMartina1, Angela L Schmoldt2, Ryan J Newton1.   

Abstract

Wastewater microbiome research often relies on sequencing of hypervariable regions of 16S rRNA genes, which are difficult to classify at refined taxonomic levels. Here, we introduce a data set of near-full-length 16S rRNA genes from samples designed to capture known geographic and seasonal variations in municipal wastewater microbial communities.

Entities:  

Year:  2022        PMID: 35727055      PMCID: PMC9302073          DOI: 10.1128/mra.00319-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Wastewater-based monitoring for disease-causing entities is growing as a public health tool (1, 2). However, there remain significant gaps in understanding the inherent biology of sewage conveyance and its potential influence on monitoring efforts. To aid the characterization of wastewater microorganisms, 46 raw wastewater treatment plant (WWTP) influent samples underwent near-full-length 16S rRNA gene sequencing. We selected samples that, according to previous work, encompass microbial community variability across geographic and seasonal gradients (3, 4). Raw influent (25-mL) samples were filtered onto 0.2-μm mixed cellulose ester filters (product number WHA10401770; MilliporeSigma), from which DNA was extracted with the FastDNA Spin kit for soil (product number 116560200-CF; MP Biomedicals) as described previously (3, 4). Genes were amplified using KAPA HiFi HotStart ReadyMix (product number KK2602; Roche) with the primers 27F (5′-AGRGTTYGATYMTGGCTCAG-3′) and 1492R (5′-RGYTACCTTGTTACGACTT) under the following thermocycler conditions: 95°C for 5 min; 20 cycles of 98°C for 20 s, 55°C for 45 s, and 72°C for 3 min; and 72°C for 5 min. Each primer contained a pad sequence (GGTAG) followed by a unique 16-bp barcode appended to the 5′ end. Prior to PCR, the barcoded primers were phosphorylated with a T4 polynucleotide kinase (product number M0201S; New England Biolabs) and ATP (product number P0756S; New England Biolabs). Following PCR, amplicons were equimolarly pooled and purified with AMPure PB beads (product number 100-265-900; Pacific Biosciences [PacBio]). Libraries were created using the SMRTbell Express Template 2.0 (product number 101-685-400; PacBio) following the manufacturer’s protocol. Amplicons were enzymatically repaired and ligated to a PacBio adapter to form the SMRTbell template. Templates were sequenced on a Sequel II system using sequencing primer v.4 (product number 101-359-000; PacBio) and the Sequel II 2.1 binding kit (product number 101-820-500; PacBio). The University of Wisconsin-Milwaukee Great Lakes Genomics Center (Research Resource Identifier [RRID] SCR_017838) provided PacBio sequencing services. Default parameters were used for all software unless otherwise specified. BAM files from the PacBio Sequel II system were converted to FASTQ files with BEDtools v.2.30.0 (5). SeqKit v.2.2.0 (6) was used to demultiplex FASTQ files into individual files according to their unique barcodes. Primers were removed from demultiplexed files with Cutadapt (7). Following a PacBio-specific protocol, DADA2 v.1.16 (8) on Galaxy v.22.01 (9) was used to quality filter (maximum N = 0, maximum EE = 2), correct errors, and assign taxonomy with SILVA v.138 (10) as a reference database. For most reads, the first 10 primer bases on the 3′ end of the read were not trimmed by Cutadapt. These bases were removed with an exact-match approach (grep/cut). The resulting amplicon sequence variants (ASVs) were clustered to operational taxonomic units (OTUs) at 99.5% similarity with mothur v.1.43.0 (11) and its protocol (https://mothur.org/wiki/cluster). Before demultiplexing, the raw FASTQ file had 7,750,870 reads, which were condensed to 1,041 ASVs and 698 OTUs. A summary of raw, ASV, and OTU data is presented in Table 1. ASVs ranged from 1,383 to 1,553 bp, with a mean length of 1,455 bp. All ASVs were classified as bacteria and included 22 phyla, 35 classes, 71 orders, 116 families, 190 genera, and 158 species. See Fig. 1 for the most abundant OTUs. The improved taxonomic resolution from full-length gene sequences resulted in 643 ASVs (61.8%) classified to the species level, compared to 3.48% in a V4-V5 hypervariable region study of a similar sample set (4).
TABLE 1

Summary of demultiplexed sequencing data (BioProject accession number PRJNA809416)

BioSample accession no.SRA accession no.State of sample collectionDate (yr-mo-day)No. of raw readsNo. of ASVsNo. of OTUs
SAMN26027580 SRR18111974 Montana2013-1-168,1242925
SAMN26027581 SRR18111973 Oregon2013-1-1650,4756638
SAMN26027582 SRR18111962 Washington2013-1-1539,8578663
SAMN26027583 SRR18111951 Iowa2013-1-1548,1027357
SAMN26027584 SRR18111940 Nebraska2013-1-2349,5106339
SAMN26027585 SRR18111933 Wisconsin2013-1-2761,3229058
SAMN26027586 SRR18111932 Alaska2013-1-2344,5058965
SAMN26027587 SRR18111931 Wyoming2013-1-2440,1513721
SAMN26027588 SRR18111930 Colorado2013-1-2363,2407444
SAMN26027589 SRR18111929 Texas2012-8-3050,78711698
SAMN26027590 SRR18111972 Alabama2012-8-1438,9607055
SAMN26027591 SRR18111971 Georgia2012-8-1648,6287860
SAMN26027592 SRR18111970 California2012-8-1443,5047859
SAMN26027593 SRR18111969 Florida2012-8-842,9938778
SAMN26027594 SRR18111968 Tennessee2012-8-1540,1646650
SAMN26027595 SRR18111967 Texas2012-8-1540,2206249
SAMN26027596 SRR18111966 Arizona2012-8-1539,1406251
SAMN26027597 SRR18111965 California2012-8-2148,4235936
SAMN26027598 SRR18111964 Florida2012-8-2143,786151109
SAMN26027599 SRR18111963 Hawaii2012-9-743,5095742
SAMN26027600 SRR18111961 Minnesota2013-1-1652,7095834
SAMN26027601 SRR18111960 Ohio2013-1-1736,9805540
SAMN26027602 SRR18111959 Wisconsin2016-4-737,9558361
SAMN26027603 SRR18111958 Wisconsin2017-4-344,27211684
SAMN26027604 SRR18111957 Wisconsin2016-8-337,1415241
SAMN26027605 SRR18111956 Wisconsin2017-8-2266,2788452
SAMN26027606 SRR18111955 Wisconsin2016-12-753,4716643
SAMN26027607 SRR18111954 Wisconsin2017-12-140,7787149
SAMN26027608 SRR18111953 Wisconsin2016-2-899,7679561
SAMN26027609 SRR18111952 Wisconsin2017-2-642,1166545
SAMN26027610 SRR18111950 Wisconsin2016-1-672,96511074
SAMN26027611 SRR18111949 Wisconsin2017-1-532,0357153
SAMN26027612 SRR18111948 Wisconsin2016-7-1846,5485540
SAMN26027613 SRR18111947 Wisconsin2017-7-1252,1438156
SAMN26027614 SRR18111946 Wisconsin2016-6-848,8615541
SAMN26027615 SRR18111945 Wisconsin2017-6-745,1767246
SAMN26027616 SRR18111944 Wisconsin2016-3-229,7994728
SAMN26027617 SRR18111943 Wisconsin2017-3-160,0416940
SAMN26027618 SRR18111942 Wisconsin2016-5-240,2538364
SAMN26027619 SRR18111941 Wisconsin2017-5-19,8384638
SAMN26027620 SRR18111939 Wisconsin2016-11-338,7025839
SAMN26027621 SRR18111938 Wisconsin2017-11-249,6937145
SAMN26027622 SRR18111937 Wisconsin2016-10-550,9087155
SAMN26027623 SRR18111936 Wisconsin2017-10-447,9855937
SAMN26027624 SRR18111935 Wisconsin2016-9-2136,1165949
SAMN26027625 SRR18111934 Wisconsin2017-9-2641,0138059
FIG 1

Genus and species assignments of abundant OTUs. The top 5 most abundant OTUs in each wastewater sample set (north and south United States and winter, spring, summer, and fall in Milwaukee, Wisconsin) were identified, comprising 64.0% of all sequences. Bar height indicates the proportion of that OTU among the common OTUs analyzed. Bar colors denote genus and species assignments of OTUs.

Genus and species assignments of abundant OTUs. The top 5 most abundant OTUs in each wastewater sample set (north and south United States and winter, spring, summer, and fall in Milwaukee, Wisconsin) were identified, comprising 64.0% of all sequences. Bar height indicates the proportion of that OTU among the common OTUs analyzed. Bar colors denote genus and species assignments of OTUs. Summary of demultiplexed sequencing data (BioProject accession number PRJNA809416)

Data availability.

Demultiplexed FASTQ files can be found in the NCBI Sequence Read Archive (SRA) under BioProject accession number PRJNA809416. Annotated files, additional analyses, and code are available at GitHub (https://github.com/NewtonLabUWM/Full16S_sewageDatabase).
  10 in total

1.  Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities.

Authors:  Patrick D Schloss; Sarah L Westcott; Thomas Ryabin; Justine R Hall; Martin Hartmann; Emily B Hollister; Ryan A Lesniewski; Brian B Oakley; Donovan H Parks; Courtney J Robinson; Jason W Sahl; Blaz Stres; Gerhard G Thallinger; David J Van Horn; Carolyn F Weber
Journal:  Appl Environ Microbiol       Date:  2009-10-02       Impact factor: 4.792

2.  Urban wastewater bacterial communities assemble into seasonal steady states.

Authors:  Emily Lou LaMartina; Aurash A Mohaimani; Ryan J Newton
Journal:  Microbiome       Date:  2021-05-20       Impact factor: 14.650

3.  SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation.

Authors:  Wei Shen; Shuai Le; Yan Li; Fuquan Hu
Journal:  PLoS One       Date:  2016-10-05       Impact factor: 3.240

4.  BEDTools: a flexible suite of utilities for comparing genomic features.

Authors:  Aaron R Quinlan; Ira M Hall
Journal:  Bioinformatics       Date:  2010-01-28       Impact factor: 6.937

5.  DADA2: High-resolution sample inference from Illumina amplicon data.

Authors:  Benjamin J Callahan; Paul J McMurdie; Michael J Rosen; Andrew W Han; Amy Jo A Johnson; Susan P Holmes
Journal:  Nat Methods       Date:  2016-05-23       Impact factor: 28.547

6.  Sewage reflects the microbiomes of human populations.

Authors:  Ryan J Newton; Sandra L McLellan; Deborah K Dila; Joseph H Vineis; Hilary G Morrison; A Murat Eren; Mitchell L Sogin
Journal:  MBio       Date:  2015-02-24       Impact factor: 7.867

7.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.

Authors:  Enis Afgan; Dannon Baker; Bérénice Batut; Marius van den Beek; Dave Bouvier; Martin Cech; John Chilton; Dave Clements; Nate Coraor; Björn A Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Saskia Hiltemann; Vahid Jalili; Helena Rasche; Nicola Soranzo; Jeremy Goecks; James Taylor; Anton Nekrutenko; Daniel Blankenberg
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

8.  The SILVA ribosomal RNA gene database project: improved data processing and web-based tools.

Authors:  Christian Quast; Elmar Pruesse; Pelin Yilmaz; Jan Gerken; Timmy Schweer; Pablo Yarza; Jörg Peplies; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2012-11-28       Impact factor: 16.971

9.  Wastewater-Based Epidemiology: Global Collaborative to Maximize Contributions in the Fight Against COVID-19.

Authors:  Aaron Bivins; Devin North; Arslan Ahmad; Warish Ahmed; Eric Alm; Frederic Been; Prosun Bhattacharya; Lubertus Bijlsma; Alexandria B Boehm; Joe Brown; Gianluigi Buttiglieri; Vincenza Calabro; Annalaura Carducci; Sara Castiglioni; Zeynep Cetecioglu Gurol; Sudip Chakraborty; Federico Costa; Stefano Curcio; Francis L de Los Reyes; Jeseth Delgado Vela; Kata Farkas; Xavier Fernandez-Casi; Charles Gerba; Daniel Gerrity; Rosina Girones; Raul Gonzalez; Eiji Haramoto; Angela Harris; Patricia A Holden; Md Tahmidul Islam; Davey L Jones; Barbara Kasprzyk-Hordern; Masaaki Kitajima; Nadine Kotlarz; Manish Kumar; Keisuke Kuroda; Giuseppina La Rosa; Francesca Malpei; Mariana Mautus; Sandra L McLellan; Gertjan Medema; John Scott Meschke; Jochen Mueller; Ryan J Newton; David Nilsson; Rachel T Noble; Alexander van Nuijs; Jordan Peccia; T Alex Perkins; Amy J Pickering; Joan Rose; Gloria Sanchez; Adam Smith; Lauren Stadler; Christine Stauber; Kevin Thomas; Tom van der Voorn; Krista Wigginton; Kevin Zhu; Kyle Bibby
Journal:  Environ Sci Technol       Date:  2020-06-12       Impact factor: 9.028

Review 10.  Future perspectives of wastewater-based epidemiology: Monitoring infectious disease spread and resistance to the community level.

Authors:  Natalie Sims; Barbara Kasprzyk-Hordern
Journal:  Environ Int       Date:  2020-04-04       Impact factor: 9.621

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.