| Literature DB >> 28174510 |
Dimitra Mavraki1, Lucia Fanini2, Marilena Tsompanou3, Vasilis Gerovasileiou1, Stamatina Nikolopoulou1, Eva Chatzinikolaou1, Wanda Plaitis1, Sarah Faulwetter4.
Abstract
BACKGROUND: This article describes the digitization of a series of historical datasets based οn the reports of the 1908-1910 Danish Oceanographical Expeditions to the Mediterranean and adjacent seas. All station and sampling metadata as well as biodiversity data regarding calcareous rhodophytes, pelagic polychaetes, and fish (families Engraulidae and Clupeidae) obtained during these expeditions were digitized within the activities of the LifeWatchGreece Research Ιnfrastructure project and presented in the present paper. The aim was to safeguard public data availability by using an open access infrastructure, and to prevent potential loss of valuable historical data on the Mediterranean marine biodiversity. NEW INFORMATION: The datasets digitized here cover 2,043 samples taken at 567 stations during a time period from 1904 to 1930 in the Mediterranean and adjacent seas. The samples resulted in 1,588 occurrence records of pelagic polychaetes, fish (Clupeiformes) and calcareous algae (Rhodophyta). In addition, basic environmental data (e.g. sea surface temperature, salinity) as well as meterological conditions are included for most sampling events. In addition to the description of the digitized datasets, a detailed description of the problems encountered during the digitization of this historical dataset and a discussion on the value of such data are provided.Entities:
Keywords: Clupeiformes ; Digitization ; Polychaeta ; Rhodophyta ; Danish Oceanographical Expedition; Data archaeology; Data management; Data rescue; Historical dataset; Marine biodiversity
Year: 2016 PMID: 28174510 PMCID: PMC5267529 DOI: 10.3897/BDJ.4.e11054
Source DB: PubMed Journal: Biodivers Data J ISSN: 1314-2828
Figure 1.The Danish Research Steamer "Thor". Image from Schmidt (1912).
List of all papers published in the series "Report on the Danish Oceanographical expeditions 1908-1910 to the Mediterranean and adjacent seas". For Volume 2, parts B, F and G, no reports were published.
|
|
|
|
|
|
| V1 | 1 | Introduction, hydrography, deposits of the sea-bottom | Johannes Schmidt | 1912 |
| V2 Biology | A1 | Flat-fishes (Heterosomata) | Harry Mcdonald Kyle | 1913 |
| V2 Biology | A2 | Poul Jespersen | 1915 | |
| V2 Biology | A3 | Shore-fishes | Louis Fage | 1918 |
| V2 Biology | A4 | Vilhelm Ege | 1918 | |
| V2 Biology | A5 | Johannes Schmidt | 1918 | |
| V2 Biology | A6 | Mediterranean | Johannes Schmidt and A. Strubberg | 1918 |
| V2 Biology | A7 | Mediterranean | Åge Vedel Tåning | 1918 |
| V2 Biology | A8 |
| Frédéric Guitel | 1920 |
| V2 Biology | A9 | Louis Fage | 1920 | |
| V2 Biology | A10 |
| Åge Vedel Tåning | 1923 |
| V2 Biology | A11 |
| Ernst Ehrenbaum | 1924 |
| V2 Biology | A12 | Mediterranean | Poul Jespersen and Åge Vedel Tåning | 1926 |
| V2 Biology | A13 | Vilhelm Ege | 1930 | |
| V2 Biology | A14 |
| W. Schnakenbeck | 1931 |
| V2 Biology | C1 |
| Eduard Degner | 1926 |
| V2 Biology | D1 | Knud Stephensen | 1915 | |
| V2 Biology | D2 | Knud Stephensen | 1918 | |
| V2 Biology | D3 | Knud Stephensen | 1923 | |
| V2 Biology | D4 | Knud Stephensen | 1924 | |
| V2 Biology | D5 | Knud Stephensen | 1926 | |
| V2 Biology | D6 |
| Johan T. Ruud | 1936 |
| V2 Biology | E1 | Pelagic polychaetes of the families, | Elise Wesenberg-Lund | 1939 |
| V2 Biology | H1 | Medusae | Paul Lassenius Kramp | 1924 |
| V2 Biology | H2 |
| H. B. Bigelow and M Sears | 1937 |
| V2 Biology | J1 | Mediterranean Ceratia | Eugen Jörgensen | 1920 |
| V2 Biology | J2 | Mediterranean | Eugen Jörgensen | 1923 |
| V2 Biology | J3 | Mediterranean | Eugen Jörgensen | 1924 |
| V2 Biology | J4 |
| J. Pavillard | 1926 |
| V2 Biology | K1 |
| Mme Paul Lemoine | 1915 |
| V2 Biology | K2 | Sea-grasses | C. H. Ostenfeld | 1918 |
| V2 Biology | K3 | Henning E. Petersen | 1918 | |
| V3 Miscellaneous papers | 1 | Experiments with drift-bottles : first report | Johannes Schmidt | 1913 |
| V3 Miscellaneous papers | 2 | The Sargasso Sea, its boundaries and vegetation | Øjvind Winge | 1923 |
| V3 Miscellaneous papers | 3 | On the quantity of macroplankton in the Mediterranean and the Atlantic | Poul Jespersen | 1923 |
| V3 Miscellaneous papers | 4 | Elvers from north and south Europe | A.C. Strubberg | 1923 |
| V3 Miscellaneous papers | 5 | Experiments with drift-bottles (second report) | Giovanni Platania | 1923 |
| V3 Miscellaneous papers | 6 | Nitrate and phosphate contents of the Mediterranean water | H. Thomsen | 1931 |
| V3 Miscellaneous papers | 7 | Some quantitative investigations on the bottom fauna at the west coast of Italy, in the Bay of Algiers, and at the coast of Portugal. | Ragnar Spärck | 1931 |
Figure 2.Timeline showing the different expeditions and the temporal coverage of the digitized publications.
Figure 3.Venn diagram showing the overlap of sampling stations across the four digitized datasets.
Summary of the content and coverage of the four digitized datasets. Note that depths may represent either sampling depths or bottom depths, this is unclear and confused in the publication (see also section "Step description" – "Difficulties and problems encountered during the digitization procedure")
|
|
|
|
| |
| Temporal coverage | 1905-06-13 to1912-01-07 | 1905-05-14 to1930-06-18 | 1904-09-05 to1914-01-24 | 1908-12-14 to1910-09-19 |
| Geographical coverage (min, max Latitude / min, max Longitude) | 30.33, 53.1 / | 0.516, 51.57 / | 28.6, 59.32 / | 30.38, 48.72 / |
| Minimum depth | 0 | 20 | 0 | 3 |
| Maximum depth | 6020 | >4000 | > 3700 | 98 |
| No. of stations | 443 | 210 | 208 | 16 |
| No. of samples | 1566 | 599 | 341 | 16 |
| No. of occurrence records | – | 883 | 646 | 59 |
| Vessels | Thor, Ingolf, Florida, Pangan, St. Croix, St. Jan, St. Thomas, Agent Petersen, Anne, Caroline Kock | Thor, Pangan, Dana | Thor, Algarvae, Nordboen, Pangan | Thor |
Figure 4.Map of the stations sampled by Thor and other vessels during the core expeditions of 1908–1909 and 1910 and additional expeditions from 1905–1906 and 1911–1912 (stations listed in the introductory table in Schmidt 1912).
Figure 5.Map of sampling stations where pelagic polychaetes were collected.
Figure 6.Map of sampling stations where fish () were collected.
Figure 7.Map of sampling stations where calcareous () were collected.
Sampling gears used during the "Thor" (and complementary) expeditions. Abbreviations are used in the table of samples in Schmidt's introductory volume (Schmidt 1912) and are also explained therein. Additional information on mesh sizes of nets was retrieved from Sverdrup et al. (1942).
|
|
|
| Aa 2 | Eel drift-seine |
| C 130 | Ring-trawl, 130 cm in diameter at opening, 1 mm mesh size, horizontal haul |
| C 200 | Ring-trawl, 200 cm in diameter at opening, 1 mm mesh size, horizontal haul |
| D 1 | Dredge, rectangular opening, 27x117 cm, 1 mm mesh size, horizontal haul |
| D 2 | Dredge, triangular opening, 45x45 cm, 1 mm mesh size, horizontal haul |
| H | Hand-Dredge, 18x14 cm |
| L | Long-line with halibut and cod hooks |
| M | Monaco trawl, 56x170 cm at opening, horizontal haul |
| N 30 | Nansen's closing net, 30 cm in diameter at opening, gauze No. 20, 0.076 mm mesh size, vertical haul |
| N 50 | Nansen's closing net, 50 cm in diameter at opening, gauze No. 20, 0.076 mm mesh size, vertical haul |
| O | Otter-trawl, head-rope 15.25 m (50 feet), 30 mm mesh size, horizontal haul |
| P 100 | Silk-net, open, conical, 100 cm in diameter at opening, gauze No. 3, 0.333 mm mesh size |
| P 30 | Silk-net, open, conical, 30 cm in diameter at opening, gauze No. 20, 0.076 mm mesh size |
| R | Shrimp net |
| S 100 | Stramin-net, open, conical, 100 cm in diameter at opening, 1 mm mesh size |
| S 150 | Stramin-net, open, conical, 150 cm in diameter at opening, 1 mm mesh size |
| S 200 | Stramin-net, open, conical, 200 cm in diameter at opening, 1 mm mesh size |
| T 25 | Taffeta-net, open, conical, 25 cm in diameter |
| Y 200 | Young-fish trawl, 200 cm in diameter at opening, 1 mm mesh size, horizontal haul |
| Y 330 | Young-fish trawl, 330 cm in diameter at opening, 1 mm mesh size, horizontal haul |
| E 1000 | Open Ringtrawl, 1000 cm in diameter |
| E 300 | Open Ringtrawl, 300 cm in diameter, meshes 24-18-12 mm |
| Y | Petersen trawl, without opening given |
Numbers of samples taken with each sampling gear during the "Thor" (and complementary) expeditions, per dataset. Samples where the gear used could not be determined (e.g. unknown abbreviation used, mixed gears reported per sample (samples merged), etc. – see also next paragraph) are excluded. Abbreviations are explained in Table 3.
|
|
|
|
|
|
| Aa 2 | 11 | |||
| C 130 | 3 | 1 | ||
| C 200 | 11 | 1 | 1 | |
| D 1 | 65 | 1 | 6 | |
| D 2 | 6 | 1 | ||
| E 1000 | 2 | |||
| E 300 | 2 | |||
| H | 10 | |||
| O | 1 | |||
| P 100 | 229 | 10 | 11 | |
| P 30 | 87 | |||
| R | 1 | |||
| S 100 | 76 | 6 | 9 | |
| S 150 | 19 | 15 | 3 | |
| S 200 | 27 | 345 | 13 | |
| T 25 | 4 | |||
| Y | 1 | 64 | ||
| Y 200 | 481 | 187 | 204 | |
| Y 330 | 219 | 20 | 26 |
Environmental parameters reported for the samples in the introductory table by Schmidt (1912).
|
|
|
|
| Weather | General meteorological conditions (cloudy, misty, clear etc.) are reported, but seem to be subjective, no scale of reference is reported anywhere. | |
| Wind Direction0–12 | Indicated in the column header as a 0–12 scale, but quadrants are reported (e.g. NNE; S) | 0–12 is an unusual scale (commonly, degrees are used) and not used in the actual data. No further information available, maybe simply a typographic error in the column name. |
| Wind Force 0–12 | Appears to be on a Beaufort scale, even if not specified in the original document. | |
| Sea Direction0–12 | Indicated to be on a 0–12 scale, but quadrants are reported (e.g. NNE; S). | Seems to be the observed direction of the swell, no further information given in the manuscript. |
| Sea Force0–12 | Appears to be on a Beaufort scale, even if not specified in the manuscript(s) (The Douglas scale would have only 10 possible values). | |
| Temperature Air | °C | Precision of one decimal place. Measurement device was not specified. |
| Temperature Surface | °C | Precision of two decimal places; given in the form of "12°40" in the manuscript. |
| Surface Chlorine | ‰ | Chlorine weight in grams per 1,000 g sea-water, unit specification from |
| Surface Salinity | ‰ | Obtained by titration in the Mediterranean Sea and by aerometer in Bosphorus and the Dardanelles. Salinity weight in grams per 1,000 g sea-water |
Figure 8.Excerpt from the list of samples of the "Thor" Expedition (taken from Schmidt 1912), showing the arrangement of information and the inconsistent use of ditto marks.
Figure 9.Number of species for , and groups according to the originally published (left) and the currently accepted (right) classification systems.
| Rank | Scientific Name | |
|---|---|---|
| class |
| |
| order |
| |
| phylum |
|
| Column label | Column description |
|---|---|
| id | Internal database ID. In event.txt identical to eventsID and constituting a unique identifier for the event (= sampling event). In measurementorfact.txt it is the ID of the event at which the measurement was taken. |
| eventID | Unique ID for the record, identical to id. |
| parentEventID | An identifier for the broader event that groups this and potentially other events – in this case, a link to the database internal ID of the station and included in the file only for compliance with the schema. |
| samplingProtocol | A free-text description of the method/ protocol used during the sampling event. |
| sampleSizeValue | A numeric value for a measurement of the size (time duration, length, area, or volume) of a sample in a sampling event. Does not contain values here, included only for compliance with the schema. |
| sampleSizeUnit | The unit of measurement of the size (time duration, length, area, or volume) of a sample in a sampling event. Does not contain values here, included only for compliance with the schema. |
| samplingEffort | Sampling time in minutes |
| eventDate | The date and time of sampling, recorded in standard format (ISO 8601:2004). |
| year | The year of sampling (four digits). |
| month | The month of sampling (one or two digits). |
| day | The day of the month of sampling (one or two digits). |
| habitat | A description of the habitat of the sample, in this case representing the information contained in the column "Nature of Bottom" of the original publication. |
| fieldNumber | A unique sampling code (across the digitized datasets). Here composed of the locationID and year and an incrementing small letter to distinguish between different sampling events at the same station in the same year. |
| eventRemarks | The name of research vessel, the bottom or sampling depth given and other remarks on the original text. Includes also any information from the original publications which was corrected during the digitization in cases it was clearly incorrect. |
| locationID | The station code as reported on the original text plus the sampling year, in some cases with slight modifications in order to avoid the same station code being used for different locations. |
| locality | The name of the locality of the sampling station, if given in the publication. |
| minimumDepthInMeters | The minimum (shallowest) sampling depth of the event. |
| maximumDepthInMeters | The maximum (deepest) sampling depth of the event. |
| locationRemarks | Comments or notes about the location and its coordinates. |
| decimalLatitude | The geographic latitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Positive values are north of the Equator, negative values are south of it. |
| decimalLongitude | The geographic longitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. |
| coordinateUncertaintyInMeters | A radius of uncertainty around the given coordinates (in metres). The true location / coordinates may fall anywhere within that circle. |
| measurementID | A unique identifier for the MeasurementOrFact (information pertaining to measurements, facts, characteristics, or assertions). |
| measurementType | The type of the measurement (e.g. Chlorine at the water surface) |
| measurementValue | The value of the measurement. |
| measurementAccuracy | he description of the potential error associated with the measurementValue |
| measurementUnit | The unit associated with the measurementValue. |
| measurementDeterminedBy | Person(s) who determined the measurementValue. |
| measurementMethod | A description of or reference to (publication, URI) the method or protocol used to determine the measurement |
| measurementRemarks | Comments or notes about the measurement. |
| Column label | Column description |
|---|---|
| id | An identifier for the sampling event, linking to the column "id" in the event.txt file. |
| institutionCode | The name (or acronym) of the institution providing and curating the record. |
| datasetName | The name of the data set from which the record was derived |
| basisOfRecord | Required by the DarwinCore schema, describing the origin of the data record. Here, all are "Human Observation". |
| occurrenceID | A unique identifier (within the dataset) for the occurrence record. |
| catalogNumber | A constructed (globally) unique identifier, combination of dataset name and fieldNumber. |
| occurrenceRemarks | Any remarks, notes, comments on the occurrence. |
| individualCount | The number of individuals of the taxon in the sample. |
| sex | The biological sex of the taxon. |
| lifeStage | The life stage of the taxon. |
| preparations | Preparations of the sample (e.g. preservation in ethanol). Empty here, included for completeness of the schema. |
| identifiedBy | The person(s) who identified the taxon. |
| identificationReferences | Bibliographical references used for the identification. Empty here, included for completeness of the schema. |
| scientificNameID | A unique identifier for the name of the taxon, here the LSID of the World Register of Marine Species. |
| scientificName | The full scientific name retrieved from the World Register of Marine Species, after matching the taxon name as in the original publication. Contains the value of the field "ScientificName" as retrieved after using the function "Match taxa" of WoRMS. |
| kingdom | The name of the kingdom in which the taxon is classified. If the taxon was not found in WoRMS at the time of taxon matching, the field is empty. |
| phylum | The name of the phylum in which the taxon is classified. If the taxon was not found in WoRMS at the time of taxon matching, the field is empty. |
| class | The name of the class in which the taxon is classified. If the taxon was not found in WoRMS at the time of taxon matching, the field is empty. |
| order | The name of the order in which the taxon is classified. If the taxon was not found in WoRMS at the time of taxon matching, the field is empty. |
| family | The name of the family in which the taxon is classified. If the taxon was not found in WoRMS at the time of taxon matching, the field is empty. |
| genus | The name of the genus in which the taxon is classified. If the taxon was not found in WoRMS at the time of taxon matching, the field is empty. |
| subgenus | The name of the subgenus in which the taxon is classified (if present). |
| specificEpithet | The specific epithet of the taxon (if at species level). |
| nomenclaturalCode | The nomenclatural code under which the scientificName is constructed. Included here only for compliance with the schema. |
| taxonomicRemarks | Any remarks on the taxon, contains also the exact version of the taxon as written in the original publication. |
| Column label | Column description |
|---|---|
| all columns | All columns have already been described above in the datasets "Introduction. Report on the Danish Oceanographical Expeditions 1908-1910 to the Mediterranean and adjacent seas" and "Danish Oceanographical expeditions 1908-1910 to the Mediterranean and adjacent seas-Pelagic Polychaetes" and are not repeated here. |
| Column label | Column description |
|---|---|
| all columns | All columns have already been described above in the datasets "Introduction. Report on the Danish Oceanographical Expeditions 1908-1910 to the Mediterranean and adjacent seas" and "Danish Oceanographical expeditions 1908-1910 to the Mediterranean and adjacent seas-Pelagic Polychaetes" and are not repeated here. |