| Literature DB >> 36057654 |
Marina G Potapova1, Sylvia S Lee2, Sarah A Spaulding3, Nicholas O Schulte3.
Abstract
Sediment diatoms are widely used to track environmental histories of lakes and their watersheds, but merging datasets generated by different researchers for further large-scale studies is challenging because of taxonomic discrepancies caused by rapidly evolving diatom nomenclature and taxonomic concepts. We collated five datasets of lake sediment diatoms from the Northeastern USA using a harmonization process which included updating synonyms, tracking the identity of inconsistently identified taxa, and grouping those that could not be resolved taxonomically. Each harmonization step led to an increase in variation explained by environmental variables and a parallel reduction of variation attributable to taxonomic inconsistency. To maximize future use of the data and underlying specimens we provide the original and harmonized counts for 1327 core samples from 607 lakes, name translation schemes, sample metadata, specimen museum locations, and the Northeast Lakes Voucher Flora, which is a set of light microscope images grouped into 1154 morphological operational taxonomic units. Post-hoc harmonization enables data quality control when other approaches (e.g., upfront management of taxonomic consistency) are not possible.Entities:
Year: 2022 PMID: 36057654 PMCID: PMC9440916 DOI: 10.1038/s41597-022-01661-3
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Data sources.
| Dataset code | Geographic coverage | Diatom Counts | Sampling site characteristics | Sampling period | Number of samples |
|---|---|---|---|---|---|
| EMAP | Northeastern USA | [ | [ | 1991–1996 | 603 |
| NLA07 | Northeastern USA | [ | [ | 2007 | 238 |
| NLA17 | Northeastern USA | [ | [ | 2017 | 106 |
| VT | Vermont state, USA | Vermont Department of Environmental Conservation | [ | 2012–2018 | 178 |
| NJ | New Jersey state, USA | Academy of Natural Sciences of Drexel University | New Jersey Department of Environmental Protection | 2005–2014 | 202 |
Fig. 1Taxonomic harmonization workflow.
Fig. 2Assessment of the degree of taxonomic inconsistency among the 5 subsets of diatom count data by Redundancy Analysis. The percent of total variation in the count data explained by “dataset” is shown in each panel and visualized by confidence ellipses in the NMDS ordination space.
Fig. 3Proportion of variation in diatom count data explained by measured environmental characteristics (blue) and taxonomic differences among five count subsets (yellow) as measured by partial RDA analyses. Green color shows the overlap in variation explained simultaneously by both environmental and taxonomic differences: this part of the variation is attributable to environmental differences among datasets.
| Measurement(s) | diatom taxa • abundance |
| Technology Type(s) | observation • literature review • digital curation |
| Sample Characteristic - Organism | Bacillariophyta |
| Sample Characteristic - Environment | Lakes |
| Sample Characteristic - Location | Northeastern Unites States of America |