| Literature DB >> 26203332 |
Petra Ten Hoopen1, Stéphane Pesant2, Renzo Kottmann3, Anna Kopf4, Mesude Bicak5, Simon Claus6, Klaas Deneudt6, Catherine Borremans7, Peter Thijsse8, Stefanie Dekeyzer6, Dick Ma Schaap8, Chris Bowler9, Frank Oliver Glöckner4, Guy Cochrane1.
Abstract
Contextual data collected concurrently with molecular samples are critical to the use of metagenomics in the fields of marine biodiversity, bioinformatics and biotechnology. We present here Marine Microbial Biodiversity, Bioinformatics and Biotechnology (M2B3) standards for "Reporting" and "Serving" data. The M2B3 Reporting Standard (1) describes minimal mandatory and recommended contextual information for a marine microbial sample obtained in the epipelagic zone, (2) includes meaningful information for researchers in the oceanographic, biodiversity and molecular disciplines, and (3) can easily be adopted by any marine laboratory with minimum sampling resources. The M2B3 Service Standard defines a software interface through which these data can be discovered and explored in data repositories. The M2B3 Standards were developed by the European project Micro B3, funded under 7(th) Framework Programme "Ocean of Tomorrow", and were first used with the Ocean Sampling Day initiative. We believe that these standards have value in broader marine science.Entities:
Keywords: Biodiversity; Bioinformatics; Data standard; Interoperability; Marine; Microbial; Molecular; Reporting
Year: 2015 PMID: 26203332 PMCID: PMC4511511 DOI: 10.1186/s40793-015-0001-5
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
M2B3 Reporting Standard about an investigation effort
| INVESTIGATION_ | Refers to a sampling activity that is either determined in time, repeated in time or continuous, e.g. a cruise, a mesocosm experiment, a time series, or live data streams | Free text | Micro B3-OSD2014 | |
| INVESTIGATION _ | Refers to the unique identifier and name of the site/station where the sampling activity is performed. | Format: <Site ID from OSD Site Registry >, <Site name from OSD Site Registry> | OSD5, Poseidon-E1-M3A Time Series Station | |
| INVESTIGATION _ | Refers to the specific unique stage from which the sampling device was deployed; includes the platform category and platform name. | Format: <Platform category from SDN:L06>,<Platform name> | research vessel, FILIA | |
| INVESTIGATION _Authors | List of people who will appear in the citation of data publications. Please order the list according to authorship. The first author is the contact person. | Format: <LASTNAME>, <FirstName>, <Institution>, <email> | JONES, Peter, Institute1, pjones@institute1.eu; SMITH, Mary, Institute2, msmith@institute2.eu | |
| INVESTIGATION _Project | Refers to the project that organised/funded the data/sample collection. | Free text | Micro B3 | |
| INVESTIGATION _Objective | Describes the scientific context/interest of the sampling activity. This information is useful to generate a short abstract as part of the data set citation. | Free text; | A short abstract | |
| 100-500 words |
Mandatory information is in bold and other fields are recommended OSD Sites Registry is a controlled register for OSD sampling Sites (http://mb3is.megx.net/osd-registry). SDN:L06::XX is a controlled terms list describing “CATEGORIES” of platforms (http://seadatanet.maris2.nl/v_bodc_vocab_v2/search.asp?lib=L06).
M2B3 Reporting Standard about a sampling event
| EVENT_ | Date and time when the sampling event started and ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. | Date and time in UTC; | 2013-06-21T14:05:00Z/ | |
| Format: yyyy-mm-ddThh:mm:ssZ | 2013-06-21T14:46:00Z | |||
| EVENT_ | Longitude of the location where the sampling event started and ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. | Format: ###.###### | 035.666666 | |
| Decimal degrees; East = +, West = -Format: Use WGS 84 for GPS data | 035.670200 | |||
| EVENT_ | Latitude of the location where the sampling event started and ended, e.g. each CTD cast, net tow, or bucket collection is a distinct event. | Format: ##.###### | −24.666666 | |
| Decimal degrees; North = +, South = -Format: Use WGS 84 for GPS data | -24.664300 | |||
| EVENT_Device | Refers to the instrument/gear used to collect the sample or the sensor used to measure environmental parameters. | Free text | 10L-Niskins or 5L-Bucket | |
| EVENT_Method | Refers to the standard deployment procedure of the Device. | Free text | 12 Niskins were deployed on a Rosette | |
| EVENT_Comment | Report any observation/deviation from the standard deployment procedure described in EVENT_Method | Free text | Lots of Jellyfish in the water |
Mandatory information is in bold and other fields are recommended.
M2B3 Reporting Standard about a sample
| SAMPLE_ | A short informative description of the sample. Must be unique for each sample, (i.e. for each filter generated during sampling). | Format: <OSD_SiteID > _ < Month > _ < Year > _ < SiteName > _ < Protocol_Label > _ < SampleNo > _ < Depth> | OSD3_06_14_Helgoland_NPL022_1_surface | |
| SAMPLE_ | The distance below the surface of the water at which a measurement was made or a sample was collected. | Format: ##.#; | 1.5 | |
| Positive below the sea surface. | ||||
| SDN:P06:46:ULAA for m | ||||
| SAMPLE_ | Identifies the protocol used to produce the sample, e.g. filtration and preservation. | Term list; | NPL022 | |
| See the | ||||
| SAMPLE_Quantity | Refers to the quantity of environment that was sampled, most often with dimensions Length, Amount, Mass or Time. | Format : ###.### | 20 Litres | |
| See the | ||||
| SAMPLE_Container | Refers to the container in which the sample is stored prior to analysis. | Term list; | Sterivex cartridge | |
| See the | ||||
| SAMPLE_Content | Refers to the content of the sample container. While the sample might target a specific organism (e.g. bacteria), the sample content might be a filter or a volume of water. | Term list; | Particulate matter on a 0.22 μm pore size filter | |
| See the | ||||
| SAMPLE_Size-Fraction_Upper-Threshold | Refers to the mesh/pore size used to pre-filter/pre-sort the sample. Materials larger than the size threshold are excluded from the sample. | Term list; | no pre-filtration | |
| See the | ||||
| SAMPLE_Size-Fraction_Lower-Threshold | Refers to the mesh/pore size used to retain the sample. Materials smaller than the size threshold are exclude from the sample. | Term list; | 0.22 | |
| See the | ||||
| SAMPLE_Treatment_Chemicals | Refers to the chemicals (e.g. preservatives) added to the sample. | Terms list: ChEBI; | None | |
| See the | ||||
| SAMPLE_Treatment_Storage | Refers to the conditions in which the sample is stored, e.g. temperature, light conditions, time. | Term list; | −80 degrees Celsius | |
| See the |
Mandatory information is in bold and other fields are recommended. OSD Protocols are available at http://www.microb3.eu/sites/default/files/osd/OSD_Handbook_v2.0.pdf. ChEBI is an ontological classification and dictionary of small chemical compounds (http://www.ebi.ac.uk/chebi/init.do).
M2B3 Reporting Standard about the sample environmental context
| Descriptor of the broad ecological context of a sample. | Terms list: EnvO | ENVO:01000023 for “marine pelagic biome” | |
| ENVIRONMENT_ | Compared to biome, feature is a descriptor of a geographic aspect or a physical entity that strongly influences the more local environment of a sample. | Terms list: EnvO | ENVO:00000209 for “photic zone” |
| ENVIRONMENT_ | Descriptor of the material that was displaced by the sampling activity, or material in which a sample was embedded, prior to the sampling event. | Terms list: EnvO | ENVO:00002149 for “sea water” |
| ENVIRONMENT_ | Temperature of water at the time of taking the sample. Define the parameter according to Table | Format: ##.# | 16.2°C |
| SDN:P02:75:TEMP | |||
| SDN:P06:46:UPAA for°C | |||
| ENVIRONMENT_ | Salinity of water at the time of taking the sample. Define the measurement according to Table | Format: ##.# | 39.1 psu |
| SDN:P02:75:PSAL | |||
| SDN:P06:46:UGKG for PSU | |||
| ENVIRONMENT_Marine_Region | It characterises the environment, based on the latitude and longitude, by reference to geographic, political, economic or ecological boundaries. | Terms list: Marine Regions | MRGID:21886 for Marine Ecoregion:South European Atlantic Shelf |
| ENVIRONMENT_Other_Parameters | Add as many fields as there are other environments parameters measured. | ||
| Define the measurement according to Table | |||
| See the list of recommended environmental parameters in Table | |||
Mandatory information is in bold and other fields are recommended EnvO is the Environment Ontology (http://www.environmentontology.org/Browse-EnvO). SDN:P02:75:XXXX is a controlled terms list describing “WHAT” is measured (http://seadatanet.maris2.nl/v_bodc_vocab_v2/search.asp?lib=P02). SDN:P06:46:XXXX is a controlled terms list describing “UNITS” of measurements (http://seadatanet.maris2.nl/v_bodc_vocab_v2/search.asp?lib=P06). Marine Regions is a standard list of marine georeferenced place names (http://www.marineregions.org/).
M2B3 Reporting Standard about environmental measurements
| General | Conductivity | Electrical conductivity of water | SDN:P02:75:CNDC | |
| SDN:P06:46:UECA for mS/cm | ||||
| Temperature of water | SDN:P02:75:TEMP | |||
| SDN:P06:46:UPAA for °C | ||||
| Depth (m) | Vertical spatial coordinates | SDN:P02:75:AHGT | ||
| SDN:P06:46:ULAA for m | ||||
| Salinity of water | SDN:P02:75:PSAL | |||
| SDN:P06:46:UGKG for PSU | ||||
| Fluorescence | Raw (volts) or converted (mg Chla/m^3) fluorescence of the water | SDN:P02:75:FVLT | ||
| SDN:P06:46:UVLT for volts | ||||
| Nutrient status of a system | Nitrate | Nitrate concentration parameters in the water column | SDN:P02:75:NTRA | |
| SDN:P06:46:UPOX for μmol/L | ||||
| Nitrite | Nitrite concentration parameters in the water column | SDN:P02:75:NTRI | ||
| SDN:P06:46:UPOX for μmol/L | ||||
| Phosphate | Phosphate concentration parameters in the water column | SDN:P02:75:PHOS | ||
| SDN:P06:46:UPOX for μmol/L | ||||
| Silicate | Silicate concentration parameters in the water column | SDN:P02:75:SLCA | ||
| SDN:P06:46:UPOX for μmol/L | ||||
| Ammonium | Ammonium concentration parameters in the water column | SDN:P02:75:AMON | ||
| SDN:P06:46:UPOX for μmol/L | ||||
| Chemical properties of a system | pH | Alkalinity, acidity and pH of the water column | SDN:P02:75:ALKY | |
| Dissolved oxygen concentration | Dissolved oxygen parameters in the water column | SDN:P02:75:DOXY | ||
| SDN:P06:46:KGUM for μmol/kg | ||||
| Optical properties of a system | Downward PAR | Visible waveband radiance and irradiance measurements in the water column | SDN:P02:75:VSRW | |
| SDN:P06:46:UMES for μE/m^2/s | ||||
| Turbidity | Transmittance and attenuance of the water column | SDN:P02:75:ATTN | ||
| SDN:P06:46:USTU for FTU or NTU | ||||
| Biogeochemistry (Amount or Mass) | Carbon organic particulate (POC) | Particulate organic carbon concentration in the water column | SDN:P02:75:CORG | |
| SDN:P06:46:UGPL for μg/L | ||||
| Nitrogen organic particulate (PON) | Particulate organic nitrogen concentration in the water column | SDN:P02:75:NTOT | ||
| SDN:P06:46:UGPL for μg/L | ||||
| Carbon organic dissolved (DOC) | Dissolved organic carbon concentration in the water column | SDN:P02:75:DOCC | ||
| SDN:P06:46:UPOX for μmol/L | ||||
| Nitrogen organic dissolved (DON) | Dissolved organic nitrogen concentration in the water column | SDN:P02:75:TDNT | ||
| SDN:P06:46:UMGL for mg/L | ||||
| Ecosystem trophic structure & biodiversity (Amount, Volume or Mass of organisms in the environment) | Pigment concentrations | Concentration of pigments (e.g. chlorophyll a) extracted and analysed by fluorometry or HPLC | SDN:P02:75:CPWC | |
| SDN:P06:46:UMMC for mg/m^3 | ||||
| Picoplankton (Flow Cytometry) | Abundance of cells in the water column (+other avail. cell properties) | SDN:P02:75:BATX | ||
| SDN:P06:46:UPMM for #/m^3 | ||||
| Nano/Microplankton | Abundance of cells in the water column (+other avail. cell properties) | SDN:P02:75:MATX or PATX | ||
| SDN:P06:46:UPMM for #/m^3 | ||||
| Meso/Macroplankton | Abundance of individuals in the water column (+other avail. properties) | SDN:P02:75:ZATX | ||
| SDN:P06:46:UPMM for #/m^3 | ||||
| Ecosystem trophic rates | Primary Production (isotope uptake) | Primary Production in the water column | SDN:P02:75:PPRD | |
| SDN:P06:46:UGDC for mg/m^3/d | ||||
| Primary Production (oxygen) | Primary Production in the water column | SDN:P02:75:PPRD | ||
| SDN:P06:46:UGDC for mg/m^3/d | ||||
| Bacterial production (isotope uptake) | Bacterial production in the water column | SDN:P02:75:UPTH | ||
| SDN:P06:46:UGDC for mg/m^3/d | ||||
| Bacterial production (respiration) | Bacterial production in the water column | SDN:P02:75:UPTH | ||
| SDN:P06:46:UGDC for mg/m^3/d | ||||
Mandatory information is in bold and other fields are recommended.
M2B3 Reporting Standard about organisms in a sample
| ORGANISM_ | An identifier for the nomenclatural (not taxonomic) details of a scientific name. | Terms list: WoRMS | urn:lsid:marinespecies.org:taxname: 345516 | |
| Format: LSID | ||||
| ORGANISM_ | The full name of the lowest level taxon. | Terms list: WoRMS | Prochlorococcus marinus | |
| Format: Taxon name | ||||
| ORGANISM_ | The sex of a specimen or collected/observed individual(s). | Terms list: M = Male; F = Female; H = Hermaphrodite; I = Indeterminate (examined but could not be determined; U = Unkown (not examined); T = Transitional (between sexes; useful for sequential hermaphrodites); B = Both Male and Female | M | |
| ORGANISM_Life_Stage | Indicates the life stage present. | Free text | resting spores | |
| ORGANISM_Size | Refers to size measurements that are made concurrently to the enumeration and identification of organisms. | |||
| Define the measurement according to Table | ||||
| ORGANISM_Biovolume | Refers to volume measurements/calculations that are made concurrently to the enumeration and identification of organisms. | |||
| Define the measurement according to Table | ||||
| ORGANISM_Biomass | Refers to biomass measurements/calculations that are made concurrently to the enumeration and identification of organisms. | |||
| Define the measurement according to Table | ||||
Mandatory information is in bold and other fields are recommended WoRMS is the World Register of Marine Species (http://www.marinespecies.org/aphia.php?p=search).
M2B3 Reporting Standard about environmental measurement processes
| MEASUREMENT_ | Unique ID from a controlled vocabulary. | SDN:P02:75:xxxx | SDN:P02:75:CORG for Particulate organic carbon concentration in the water column | |
| MEASUREMENT _Name | Common name for the measurement. | Free text | POC | |
| MEASUREMENT _Quantity | Describes the quantity measured using terms from the Système International of units. | Free text; SI of units | Mass concentration | |
| MEASUREMENT _Dimensions | Describes the quantity measured using dimension terms from the Système International of units. | Free text; SI of units | M^1 L^-3 | |
| MEASUREMENT _Currency | May often refer to a TAXONOMY_ID or a CHEMICAL_ID. | Free text; | Organic carbon | |
| Terms list: WoRMS; | ||||
| Terms list: ChEBI | ||||
| MEASUREMENT _Units | Describes the units of the quantity measured using terms from the Système International of units. | SDN:P06:46:xxxx | SDN:P06:46:UGPL for μg/L | |
| MEASUREMENT _Method | Describes the measurement method used. Equivalent to methodological details provided in a paper. | Free text | Mass spectrometry | |
| MEASUREMENT _Comment | Any comment about the measurement. | Free text | Inorganic carbon removed by acidification |
Mandatory information is in bold and other fields are recommended.
Figure 1M2B3 Reporting Standard descriptors schematically depicted on the junction of three disciplines, adopting existing standards of each domain.
Figure 2Mandatory and recommended information of the M2B3 Reporting Standard; descriptors are split into six categories represented by coloured triangles, where mandatory descriptors are in the dark-shaded area and recommended information elements are in the light-shaded area. Environmental measurements in the ENVIRONMENT section are further specified in Figure 3.
Figure 3Mandatory (in the dark green area) and recommended (in the light green area) environmental measurements of the M2B3 Reporting Standard.
Figure 4The logical connection between environmental measurements (Table5), recording of the measurements (Table7) and measured values, shown on the example of three environmental parameters – salinity, nitrate and carbon organic particulate (POC).