| Literature DB >> 29087368 |
Alyse K Hawley1, Mónica Torres-Beltrán1, Elena Zaikova2, David A Walsh3, Andreas Mueller1, Melanie Scofield1, Sam Kheirandish1, Chris Payne4, Larysa Pakhomova4, Maya Bhatia1, Olena Shevchuk1, Esther A Gies5, Diane Fairley1, Stephanie A Malfatti6, Angela D Norbeck7, Heather M Brewer7, Ljiljana Pasa-Tolic7, Tijana Glavina Del Rio6, Curtis A Suttle1,4,8, Susannah Tringe6, Steven J Hallam1,9,10,11,12.
Abstract
Marine oxygen minimum zones (OMZs) are widespread regions of the ocean that are currently expanding due to global warming. While inhospitable to most metazoans, OMZs are hotspots for microbial mediated biogeochemical cycling of carbon, nitrogen and sulphur, contributing disproportionately to marine nitrogen loss and climate active trace gas production. Our current understanding of microbial community responses to OMZ expansion is limited by a lack of time-resolved data sets linking multi-omic sequence information (DNA, RNA, protein) to geochemical parameters and process rates. Here, we present six years of time-resolved multi-omic observations in Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia, Canada that undergoes recurring changes in water column oxygenation status. This compendium provides a unique multi-omic framework for studying microbial community responses to ocean deoxygenation along defined geochemical gradients in OMZ waters.Entities:
Year: 2017 PMID: 29087368 PMCID: PMC5663217 DOI: 10.1038/sdata.2017.160
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Summary of multi-omic samples collected in Saanich Inlet time series.
(a) Oxygen concentration contour for CTD data (February 2008 onward)[35] indicating 16 sampling depths for water column geochemistry and high-resolution (HR) DNA samples for SSU libraries (small black dots) and six major depths for large volume (LV) samples for meta-genomics, -transcriptomics, -proteomics and LV SSU libraries (large black dots). (b) Sample inventory from February 2006 to October 2014 indicating multi-omic datasets included in this manuscript (solid black), in previous publications (gray) and accompanying datasets currently undergoing processing and analysis (open gray).
Summary of datasets, number of samples and sizes for SSU rRNA gene tag sequences.
|
|
|
|
| |
|---|---|---|---|---|
| LV PyroTags | 99 | 49,635 | 286,755 | 831 |
| LV iTags | 19 | 306,365 | 1051 | 355,883 |
| LV_PF iTags | 16 | 345,756 | 78,925 | 377,992 |
| HR PyroTag | 311 | 118,641 | 981,153 | 2752 |
| HR iTags | 47 | 315,487 | 1034 | 448,003 |
*200–540 bp for PyroTags and >130 bp for iTags.
Summary of datasets, number of samples and sizes for metagenome, metatranscriptome and metaproteome sequencing.
|
|
|
|
| |
|---|---|---|---|---|
| Metagenomes | 90 | 1.80E+08 | 3.43E+05 | 4.69E+05 |
| Metatranscriptomes | 62 | 4.65E+07 | 1.09E+05 | 1.22E+05 |
|
|
|
| ||
| Metaproteomes | 68 | 4.76E+03 | 5.81E+04 | |
Key to data files in Supplementary Table 1 SSU rRNA gene tag inventory.
|
|
|
|---|---|
|
| Identifier of unique time-series time point and depth in
which seawater sample for dataset was obtained, links to geochemical
time series data (Torres Beltrán |
|
| Numerical identifier of individual cruises |
|
| Year of cruise |
|
| Month of cruise |
|
| Indicates the sampling station from which seawater sample was obtained |
|
| Depth at which seawater sample was obtained |
|
| NCBI BioSample ID for PyrtoTag (V6-V8 region) sequenced
pre-filtered samples (2.7–0.22 μm fraction)
on NCBI website ( |
|
| NCBI BioSample ID for iTag (V4-V5 region) sequenced
pre-filtered samples (2.7–0.22 μm fraction)
on NCBI website ( |
|
| NCBI BioSample ID for PytoTag (V6-V8 region) sequenced
pre-filter samples (>2.7 μm fraction) on
NCBI ( |
|
| NCBI BioSample ID for iTag (V6-V8 region) sequenced
non-pre-filtered samples (>0.22 μm fraction)
on NCBI website ( |
|
| Sequencing centre for HR PyroTag samples. JGI denotes Joing Genome Institute, GQ denotes Genome Quebec. All other sequencing was carried out at JGI. |
|
| NCBI BioSample ID for iTag (V4-V5 region) sequenced
non-pre-filtered samples (>0.22 μm fraction)
on NCBI website ( |
Key to the data fields in the Supplementary Table 2: Metagenomes (MetaG), Metatranscriptomes (MetaT), and Metaproteomes (MetaP) inventory.
|
|
|
|---|---|
|
| Identifier of unique time-series time point and depth in
which seawater sample for dataset was obtained, links to geochemical
time series data (Torres Beltrán |
|
| Numerical identifier of individual cruises |
|
| Year of cruise |
|
| Month of cruise |
|
| Indicates the sampling station from which seawater sample was obtained |
|
| Depth at which seawater sample was obtained |
|
| Indicates if SSU rRNA tag data exists for that sample and
what type of tag (see |
|
| JGI Project ID for the IMG/M website ( |
|
| NCBI BioSample ID for metatranscriptome at NCBI website
( |
|
| JGI Project ID for the IMG/M website ( |
|
| NCBI BioSample ID for metatranscriptome at NCBI website
( |
|
| File name prefix in PRIDE database website ( |
Key to files in PRIDE metaproteome repository PDX004433.
|
|
|
|---|---|
|
| Parameter and settings files used for the database search of spectra to peptide |
|
| De-isotoped values of mass, observed charged states, and chromatographic elution times from the mass spectrometry runs |
|
| Mass spectrometry run files, in original format |
|
| Amino Acid sequence file for all detected proteins from all Saanich Inlet metaproteome samples. |
|
| Tabular lists of identified peptides, associated confidence scores, and protein reference names |
|
| Protein lists from all samples, including redundant peptide to protein matches |
|
| FASTA with duplicate sequences removed, cleaned and trimmed for use with the search engine |
Figure 2Data Validation figures for SSU rRNA tag sequencing and metagenomes.
(a) 454 PyroTags for small subunit rRNA gene showing number of raw reads versus read length for large volume samples (99 samples in total) (left) and high resolution samples (311 samples in total) (right). (b) Metagenomic assemblies for two samples from different depths showing average fold coverage versus contig length and percentage GC versus average fold coverage for contigs.
Figure 3Data validation figures for metatranscriptomes and metaproteomes.
(a) Metatranscriptomic reads for two samples from different depths showing distribution of reads over read quality (left) and percentage GC (right). (b) Metaproteome showing number of detected peptides (top) and detected proteins (bottom) for each depth sampled, colour coded by cruise ID. Higher number of detected proteins than peptides is due the sequence redundancy in the metagenomic database used to identify peptides.