| Literature DB >> 32963248 |
Alessandro Filazzola1, Octavia Mahdiyan2, Arnab Shuvo2, Carolyn Ewins2, Luke Moslenko2, Tanzil Sadid2, Kevin Blagrave2, Mohammad Arshad Imrit2, Derek K Gray3, Roberto Quinlan2, Catherine M O'Reilly4, Sapna Sharma2.
Abstract
Measures of chlorophyll represent the algal biomass in freshwater lakes that is often used by managers as a proxy for water quality and lake productivity. However, chlorophyll concentrations in lakes are dependent on many interacting factors, including nutrient inputs, mixing regime, lake depth, climate, and anthropogenic activities within the watershed. Therefore, integrating a broad scale dataset of lake physical, chemical, and biological characteristics can help elucidate the response of freshwater ecosystems to global change. We synthesized a database of measured chlorophyll a (chla) values, associated water chemistry variables, and lake morphometric characteristics for 11,959 freshwater lakes distributed across 72 countries. Data were collected based on a systematic review examining 3322 published manuscripts that measured lake chla, and we supplemented these data with online repositories such as The Knowledge Network for Biocomplexity, Dryad, and Pangaea. This publicly available database can be used to improve our understanding of how chlorophyll levels respond to global environmental change and provide baseline comparisons for environmental managers responsible for maintaining water quality in lakes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32963248 PMCID: PMC7508946 DOI: 10.1038/s41597-020-00648-2
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Workflow for all datasets included in the chlorophyll and water chemistry database.
Information about each of the data repositories that were obtained online including the number of lakes, number of observations, timeframe of surveys, and a relevant study that utilized the data.
| ID | Name | Lakes | Observations | Time frame | Relevant study | Notes |
|---|---|---|---|---|---|---|
| Repo1 | Ecology under lake ice | 39 | 1231 | 1969–2017 | Hampton | Paired winter and summer observations |
| Repo2 | Limnological data and depth profile from Oneida Lake | 1 | 222 | 1975–2018 | Karatayev | Measured weekly and averaged from five different locations |
| Repo3 | Transparency, geomorphology, and mixing regime explain variability in trends in lake temperature and stratification across northeastern North America | 215 | 219 | 1975–1985 | Richardson | Samples were measured in 1975, 1985, or both. |
| Repo4 | The European Multi Lake Survey (EMLS) dataset of physical, chemical, algal pigments and cyanotoxin parameters 2015 | 332 | 345 | 2015 | Mantzouki | Surveyed in the summer. Some sampling points were reclassified as within the same lake |
| Repo5 | Water quality database | 2,168 | 9,568 | 1967–2019 | An online database that joins data collected from multiple US government Agencies | |
| Repo6 | The Lake Inventory Program (formerly known as the Lake Survey Program) | 96 | 103 | 1974–2010 | Samples were taken during the summer months at varying sampling depths and averaged | |
| Repo7 | National Aquatic Resource Surveys | 1,059 | 1,162 | 2007 | Pollard | An integrated sampler was used to collect chla data at the centre of the lake |
| Repo8 | McMurdo Dry Valleys Chlorophyll-A Concentrations in Lakes | 7 | 102 | 1993–2016 | Burnett | Sampling is conducted below permanent ice-cover in summer months |
| Repo9 | Lake Kasumigaura Database | 1 | 476 | 1977–2016 | Takamura & Nakagawa (2012) Ecological Research[ | Twelve stations within the lake are sampled monthly |
| Repo10 | Cascade Project at North Temperate Lakes LTER High Frequency Sonde Data from Food Web Resilience Experiment 2008–2011 | 2 | 8 | 2008–2011 | Gries | Samples were collected at 5-minute intervals during the summer and averaged for the year |
| Repo11 | Lake Metabolism at North Temperate Lakes LTER 2000 | 24 | 24 | 2000 | Gries | Measurements were taken in July and August |
| Repo12 | Landscape Position Project at North Temperate Lakes LTER: Chlorophyll 1998–2000 | 49 | 52 | 1998–2000 | Gries | Samples were taken two times or monthly in the summer at three depths. |
| Repo13 | Unpublished data, Massachusetts Department of Environment Protection, lake water chemistry data, 1995–2004 | 111 | 111 | 1999–2004 | Five sampling events interspersed throughout the summer and averaged | |
| Repo14 | LAGOS-NE: a multi-scaled geospatial and temporal database of lake ecological context and water quality for thousands of US lakes | 8,218 | 209,732 | 1933–2013 | Soranno | A dataset compilation across government agencies and universities in the USA |
Fig. 2The distribution of lakes included in database that have measured chlorophyll values. Insets are provided for the USA and Europe to better separate the high density of observations from lakes in these areas.
Table attributes and descriptions from database of chlorophyll values in freshwater lakes (ChlData.csv).
| Attribute (column header) | Description of attribute | Data with values (%) |
|---|---|---|
| uniqueID | Unique identifier for each respective survey instance that exists across all datasets within this database | |
| UniqueLakeName | Unique lake identifier for reference across studies. | |
| StudyID | Study identifier to be connected to the data | |
| Year | Year that lake was surveyed. Can be discrete (e.g. 2005, 2006) or a range of years where the values were averaged (e.g. 2005–2007). | |
| Month | Month that lake was surveyed as a number | |
| Lat | Latitude of survey instance (decimal degrees) | |
| Lon | Longitude of survey instance (decimal degrees) | |
| LakeName | Name of lake as identified within the manuscript or data repository | |
| ChlaValues | Average concentration of chlorophyll a in freshwater lakes at each survey instance (mg L−1) | 100 |
| TP | Average concentration of total phosphorus in freshwater lakes at each survey instance (mg L−1) | 49.0 |
| TN | Average concentration of total nitrogen in freshwater lakes at each survey instance (mg L−1) | 17.3 |
| DOC | Average concentration of dissolved organic carbon in freshwater lakes at each survey instance (mg L−1) | 4.6 |
| DO | Average concentration of dissolved oxygen are in freshwater lakes at each survey instance (mg L−1) | < 1 |
| LakeVolume | The volume of the lake that was surveyed in m3 | < 1 |
| SurfaceArea | The measured surface area of the lake that was surveyed (km2) | 92.9 |
| Depth.mean | The average depth of the lake that was sampled in meters | 31.9 |
| Depth.max | The maximum depth of the lake that was sampled in meters | 82.5 |
| Secchi | The distance underwater that the Secchi depth was no longer visible from the surface (meters) | 85.8 |
| pH | The pH of sampled water | 5.7 |
| Chla.flag | An identifier to highlight Chla values that are below the detection limits listed in the study and thus subjected to inaccuracies. | 1.5 |
Table attributes and descriptions for meta-data files on studies (MS.citations.csv), data repositories (Repo.citations.csv), and methods of data collection (methodsData.csv).
| Data label | Description |
|---|---|
| StudyID | Identifier for the published study |
| Title | Title of published study |
| Authors | Authors of published study |
| Source Title | Journal that the study was published in |
| Publication Year | Year that the study was published |
| Volume | Volume from journal |
| Issue | Issue from journal |
| Beginning Page | First page in the journal that the study was published |
| Ending Page | Last page in the journal that the study was published |
| DOI | Digital Object Identifier associated with study |
| Total Citations | Total number of citations associated with the study as of October 2018 |
| Exclude | Whether the study was excluded from the database |
| reason.simplified | A simplified reason why the study was not used |
| StudyID | Study identifier to be connected to the data |
| StudyName | Name of study |
| Link | Link where data were obtained from |
| Author | Authors that were listed in study |
| Title | Title of study |
| DataSource | Source data were acquired from including databases, repositories, or online searches |
| Year | Year the dataset was published |
| Included | Whether the dataset was processed and added to the main dataset |
| StudyID | Study identifier to be connected to the data |
| Year | Year that the study was published |
| Chl method | The method of which the chlorophyll sample was measured |
| MeasurementType | The type of value as either the mean, median, or raw (unaggregated) |
| DetectionLimits | The lowest recorded measurement within the study |
| Survey.Type | The collect method, either |
| Depth.qual | A qualitative description of the depth that the measurement was taken such as surface, integrated or specific depth. |
| Depth.quant | A description of the depths that the measurement was collected |
| Column.rep | The number of depths that an integrated measurement was collected |
| Replicate | The total number of measurements that were included in generating the mean or number of observations. Includes replicates in column, area of lake, and time. |
| Spatial.rep | The number of locations within or among lakes that samples were collected |
| Spatial.qual | A description of the locations within a lake that a sample was collected (e.g. integrated, center, shoreline). |
| Temporal.rep | The number of measurements over time that were collected |
| Temporal.qual | A description of the time interval that was used for sampling. |
| StartDay | The day of the month the surveys began |
| StartMonth | The month of the year that the surveys began |
| StartYear | The year that the surveys began |
| EndDay | The day of the month the surveys ended |
| EndMonth | The month of the year that the surveys ended |
| EndYear | The year that the surveys ended |
| DepthDetails | A description of the sampling that was conducted on the column |
| DepthShallow | The shallowest depth that a sample was collected |
| DepthMean | The average depth samples were collected |
| DepthDeep | The maximum depth that a sample was collected. -999 represents the bottom of the lake. |
| NumObs | The total number of observations that are present in the study that are included in the database. |
Means and ranges of lake characteristics and water chemistry.
| Variable | Units | Mean | Range | Sample size (n) |
|---|---|---|---|---|
| Year | — | 2002 | 1933–2019 | 228,168 |
| TN | mg L−1 | 0.908 | 0–20.6 | 39,457 |
| TP | mg L−1 | 0.042 | 0–3.6 | 111,872 |
| DO | mg L−1 | 9.82 | 1.32–67.7 | 761 |
| DOC | mg L−1 | 0.008 | 0.01–1 | 10,517 |
| Max depth | meters | 15.6 | 0–310 | 188,205 |
| Mean depth | meters | 7.00 | 0.2–154 | 72,786 |
| pH | — | 7.99 | 5.5–10.7 | 12,934 |
| Secchi depth | meters | 2.76 | 0–61.7 | 195,782 |
| Surface area | squared kilometers | 25.11 | <0.001–32,056 | 211,975 |
| Chla | mg L−1 | 0.017 | 0–4.33 | 228,168 |
Fig. 3Frequency of observed chlorophyll values found in the lake dataset (n = 228,168).
Fig. 4Distribution of water chemistry and lake morphometry values from database. Values represent log-transformed equivalent of the units presented in Table 4, except pH which is already log-transformed.
| Measurement(s) | chlorophyll a • phosphorus atom • nitrogen atom • dissolved carbon atom in water • dissolved oxygen in water • volume • lake surface area • depth of water • pH |
| Technology Type(s) | digital curation |
| Factor Type(s) | geographic location |
| Sample Characteristic - Environment | freshwater lake |
| Sample Characteristic - Location | Earth (planet) |