| Literature DB >> 33863919 |
Alyse A Larkin1, Catherine A Garcia1, Nathan Garcia1, Melissa L Brock2, Jenna A Lee1, Lucas J Ustick2, Leticia Barbero3,4, Brendan R Carter5,6, Rolf E Sonnerup5,6, Lynne D Talley7, Glen A Tarran8, Denis L Volkov3,4, Adam C Martiny9,10.
Abstract
Detailed descriptions of microbial communities have lagged far behind physical and chemical measurements in the marine environment. Here, we present 971 globally distributed surface ocean metagenomes collected at high spatio-temporal resolution. Our low-cost metagenomic sequencing protocol produced 3.65 terabases of data, where the median number of base pairs per sample was 3.41 billion. The median distance between sampling stations was 26 km. The metagenomic libraries described here were collected as a part of a biological initiative for the Global Ocean Ship-based Hydrographic Investigations Program, or "Bio-GO-SHIP." One of the primary aims of GO-SHIP is to produce high spatial and vertical resolution measurements of key state variables to directly quantify climate change impacts on ocean environments. By similarly collecting marine metagenomes at high spatiotemporal resolution, we expect that this dataset will help answer questions about the link between microbial communities and biogeochemical fluxes in a changing ocean.Entities:
Mesh:
Year: 2021 PMID: 33863919 PMCID: PMC8052323 DOI: 10.1038/s41597-021-00889-9
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Distribution of global surface microbial metagenomes from Bio-GO-SHIP (circles) in comparison to Tara Oceans (squares) and bioGEOTRACES (ovals). Symbol colours match the corresponding cruise name label colour.
Sampling protocols and read counts for global Bio-GO-SHIP surface ocean metagenomes.
| Cruise/Year | DNA Collection | DNA Volume | Station Count | Total Reads | Total Bases | Median Bases Per Sample | Range of Bases Per Sample | Total Pre-Filter Reads ≥ Q25 |
|---|---|---|---|---|---|---|---|---|
| I07N 2018 | Underway, CTD | 4 L, 2-4 L | 248 | 6.20 × 109 | 9.36 × 1011 | 3.27 × 109 | 2.47 × 108–1.42 × 1010 | 4.65 × 109 |
| I09N 2016 | Underway | 10 L | 242 | 5.73 × 109 | 8.64 × 1011 | 3.10 × 109 | 4.71 × 108–1.22 × 1010 | 4.15 × 109 |
| C13.5 2020 | Underway | 5-10 L | 229 | 5.94 × 109 | 8.96 × 1011 | 2.94 × 109 | 3.98 × 108–2.14 × 1010 | 4.17 × 109 |
| P18 2016-17 | CTD | 2 L | 104 | 3.22 × 109 | 4.86 × 1011 | 4.46 × 109 | 6.14 × 107–1.77 × 1010 | 2.55 × 109 |
| AMT-28 2018 | CTD | 2 L | 63 | 2.18 × 109 | 3.29 × 1011 | 4.95 × 109 | 1.62 × 109–1.22 × 1010 | 1.71 × 109 |
| NH1418 2014 | CTD | 2 L | 23 | 5.41 × 108 | 8.17 × 1010 | 3.03 × 109 | 2.42 × 109–1.08 × 1010 | 4.02 × 108 |
| AE1319 2013 | CTD | 2 L | 13 | 2.01 × 108 | 3.03 × 1010 | 4.69 × 109 | 2.15 × 109–7.43 × 109 | 1.63 × 108 |
| BVAL46 2011 | CTD | 2 L | 12 | 2.01 × 108 | 3.04 × 1010 | 2.73 × 109 | 2.33 × 109–4.88 × 109 | 1.61 × 108 |
Publicly available metadata variables collected on Bio-GO-SHIP cruises.
| Campaign | Data Host and License | DOI | Metadata Variables |
|---|---|---|---|
| I07N, GO-SHIP | CCDHO; PDM | 10.7942/C25H2B | Temperature, Salinity, Dissolved O2, Nutrients (NO3, NO2, PO4, SiO4), Chlorofluorocarbons (CFCs) /SF6, Dissolved Inorganic Carbon, Dissolved Organic Carbon, Total pH, Total Alkalinity, Stable gases (N2O), Calcium |
| I09N, GO-SHIP | CCDHO; PDM | 10.7942/C2008W | Temperature, Salinity, Dissolved O2, Nutrients (NO3, NO2, NH4, PO4, SiO4), Chlorofluorocarbons (CFCs) /SF6, 13C and 14C of DIC, Dissolved Inorganic Carbon, Dissolved Organic Carbon, Total pH, Total Alkalinity, Stable gases (N2, N2O, Ar), 18O, Chromophoric Dissolved Organic Matter (CDOM), Pigment HPLC, Chlorophyll A, Dissolved/ particulate/ cellular P and Fe, N P and Fe uptake rates |
| C13.5*/A13.5, GO-SHIP | CCDHO; PDM | 10.7942/C2894Zs | Temperature, Salinity, Dissolved O2, Chlorophyll fluorometer and scattering |
| P18, GO-SHIP | CCDHO; PDM | 10.7942/C21T0F | Temperature, Salinity, Dissolved O2, Nutrients (NO3, NO2, PO4, SiO4), Chlorofluorocarbons (CFCs) /SF6, Dissolved Inorganic Carbon (DIC), Total pH, Total Alkalinity |
AMT-28, PML AMT SOCCOM, NSF | BODC, NERC Open Government License SOCCOM, PDM | https://doi.org/10/fqkd | Temperature, Salinity, Dissolved O2, Density, Fluorescence, PAR Irradiance, |
| NH1418, NSF | BCO-DMO, WHOAS; CC BY 4.0 | 10.26008/1912/bco-dmo.829895.1 | Temperature, Salinity, Dissolved O2 / Saturation, Density, Chlorophyll a, PAR irradiance, Fluorescence, Nutrients (NO3 + NO2, NO2), Soluble Reactive Phosphorus (SRP), Particulate Organic C N and P, |
| AE1319, NSF | BCO-DMO, WHOAS; CC BY 4.0 | 10.26008/1912/bco-dmo.829797.1 10.26008/1912/bco-dmo.538091.2 | Temperature, Salinity, Dissolved O2, PAR irradiance, Chlorophyll a, Nutrients (NO3 + NO2, PO4, SiO4), Soluble Reactive Phosphorus (SRP), Particulate Organic C N and P, |
| BVAL46, NSF, BATS | BCO-DMO, WHOAS; CC BY 4.0 | 10.26008/1912/bco-dmo.829843.1 10.26008/1912/bco-dmo.538091.2 | Temperature, Salinity, Dissolved O2, Chlorophyll a, Nutrients (NO3 + NO2, PO4, SiO4), Soluble Reactive Phosphorus (SRP), Particulate Organic P, |
These data may be updated as additional samples or stations are processed by the principal investigators of each dataset. Another 48 metadata variables not listed here were collected aboard the GO-SHIP, PML AMT, and NSF cruises and may be available upon request from CCDHO, BODC, or SOCCOM.
*C13.5 is a partial occupation of the A13.5 GO-SHIP line that was aborted due to COVID-19. Thus, CTD casts corresponding to DNA collection were only performed at 8 stations.
Fig. 2Comparison of the distance between stations, station latitudes, and station longitudes for global surface ocean metagenomes. Individual station locations from (A) Bio-GO-SHIP, (B) bioGEOTRACES and (C) Tara Oceans were examined. Plots are labelled with the median value, M. Station distance was calculated as the distance to the nearest station.
Sequencing run breakdown of Bio-GO-SHIP metagenomes including technical validation statistics.
| Run | Cruises | Illumina Platform | Sample Count | Library Concentration (nM) | Fragments 200–600 bp |
|---|---|---|---|---|---|
| 1 | I09N | HiSeq | 24 | 4.1* | 84.02%* |
| 2 | I09N, AE1319, BVAL46, NH1418 | HiSeq | 30 | 16.52 | 98.85% |
| 3 | I09N | NovaSeq | 215 | 16.29 | 86.34% |
| 4 | P18, AMT-28, AE1319, BVAL46, NH1418 | NovaSeq | 203 | 35.64 | 93.87% |
| 5 | I07N | NovaSeq | 248 | 32.25 | 91.72% |
| 6 | C13.5 | NovaSeq | 251 | 23.15 | 87.51% |
*Run 1 was concentrated via SpeedVac to 15 nM and bead size-selected such that 90% of fragments were between 200–600 bp by the UC Davis Genome Center DNA Technologies Core prior to sequencing. Final values for this run are not available.
| Measurement(s) | DNA sequencing • temperature of water • concentration of phosphate in water • concentration of nitrogen atom in water |
| Technology Type(s) | Illumina sequencing • watercraft • continuous flow autoanalyzer |
| Sample Characteristic - Organism | marine metagenome |
| Sample Characteristic - Environment | ocean |
| Sample Characteristic - Location | global |