| Literature DB >> 35710792 |
Kristin K Isaacs1, Jonathan T Wall2, Ashley R Williams3, Kevin A Hobbie3, Jon R Sobus2, Elin Ulrich2, David Lyons2, Kathie L Dionisio2, Antony J Williams2, Christopher Grulke2, Caroline A Foster3, Josiah McCoy3, Charles Bevington4.
Abstract
Direct monitoring of chemical concentrations in different environmental and biological media is critical to understanding the mechanisms by which human and ecological receptors are exposed to exogenous chemicals. Monitoring data provides evidence of chemical occurrence in different media and can be used to inform exposure assessments. Monitoring data provide required information for parameterization and evaluation of predictive models based on chemical uses, fate and transport, and release or emission processes. Finally, these data are useful in supporting regulatory chemical assessment and decision-making. There are a wide variety of public monitoring data available from existing government programs, historical efforts, public data repositories, and peer-reviewed literature databases. However, these data are difficult to access and analyze in a coordinated manner. Here, data from 20 individual public monitoring data sources were extracted, curated for chemical and medium, and harmonized into a sustainable machine-readable data format for support of exposure assessments.Entities:
Year: 2022 PMID: 35710792 PMCID: PMC9203490 DOI: 10.1038/s41597-022-01365-8
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Sources included in this Multimedia Chemical Monitoring Database.
| Source | Abbreviation | Source Description | Website |
|---|---|---|---|
| American Healthy Homes Survey[ | ahhs | Nationally-representative study of contaminants in homes by U.S. Department of Housing and Urban Development | |
| National Atmospheric Deposition Program (Atmospheric Integrated Research Monitoring Network (AIRMoN) | airmon | AIRMoN is a monitoring network of seven sites in the Eastern U.S. - data were available for 1992–2015 (no | |
| Biomonitoring California | biomon_ca | Collaborative biomonitoring effort (The California Environmental Contaminant Biomonitoring Program, also known as Biomonitoring California), implemented by the California Department of Public Health and the California Environmental Protection Agency | |
| California Air Monitoring Network[ | ca_airmon | Multi‐year air monitoring network to measure pesticides in various agricultural communities in California (2012–2016) | |
| California Surface Water Database | ca_surf | Surface Water Database (SURF) maintained by the California Department of Pesticide Regulation (DPR), containing data from a wide variety of environmental monitoring studies | |
| California Air Resources Board (CARB)[ | carb | Report from CARB to the California Legislature on indoor air pollution (2005) | |
| ChemTheatre | chem_theatre | ChemTHEATRE: Chemicals in the THEATRE [Tractable and Heuristic E-Archive for Traceability and Responsible-care Engagement], a platform for archival of environmental measurements supported by the Long-range Research Initiative (LRI) and the Japan Chemical Industry Association (JCIA) | |
| Comparative Toxicogenomics Database[ | ctd | A robust, publicly available database of data from published sources that aims to advance understanding about how environmental exposures affect human health | |
| EPA Nine POTW Study | epa_9potw | Results from an EPA Study of the occurrence of contaminants of emerging concern in wastewater from publicly owned treatment works (POTW) | |
| U.S. Environmental Protection Agency (EPA) Ambient Monitoring Technology Information Center – Air Toxics Data | epa_amtic | Ambient Monitoring Archive of the EPA’s Ambient Monitoring Technology Information Center (AMTIC) The archive covers measurements of hazardous air pollutants (HAPS) from as early as 1990 to 2016. The archive for HAPs currently houses data from over 2,500 monitoring sites. | |
| EPA Discharge Monitoring Report Data | epa_dmr | State-level data for 2007–2016 from discharge monitoring reports from EPA’s Enforcement and Compliance History Online site. | |
| EPA Office of Water, National Study of Chemical Residues in Lake Fish Tissue[ | epa_nscrlft | Data from a published report on a national EPA study to estimate the national distribution of selected persistent, bioaccumulative, and toxic (PBT) chemical residues in fish tissue from lakes and reservoirs of the United States. | |
| Targeted National Sewage Sludge Survey[ | epa_tnsss | 2009 EPA survey to examine over 350 pollutants in sewage sludge. | |
| EPA Unregulated Contaminant Monitoring Rule | epa_ucmr | Data collected under the EPA Unregulated Contaminant Monitoring Rule (UCMR3). The rule is used to collect data for contaminants that are suspected to be present in drinking water and do not have health-based standards set under the Safe Drinking Water Act (SDWA). State-level data from 2013–2015. | |
| U.S. Food and Drug Administration (FDA) Total Diet Study | fda_tds | Ongoing FDA program that monitors levels of about 800 contaminants and nutrients in the average U.S. diet. Database includes data from 2003–2011. | |
| ICES-DOME | ices | Marine Environment Data Portal of The International Council for the Exploration of the Sea (ICES), an intergovernmental marine science organization. | |
| Information Platform for Chemical Monitoring Data (IPCHEM) | ip_chem | IPCHEM is a web single access point for locating and accessing chemical monitoring data across all media in the European Union. Data included both environmental and biomonitoring data. | |
| National Health and Nutrition Examination Survey | nhanes | National Health and Nutrition Survey. 2018 Fourth National Report on Human Exposure to Environmental Chemicals. Updated Tables, March 2018, Volume One. | |
| U.S. Department of Agriculture (USDA) National Residue Program (NRP) | usda_nrp | Chemical residue results for meat, poultry, and egg products. | |
| United States Geological Service (USGS) Monitoring Data –National Water Quality Monitoring Council | usgs | Monitoring data from USGS for air, biological tissue, groundwater, sediment, soil, surface water, and tissue (2010–2018). |
The sources may be further refined (e.g., by media type or other data subset) in later tables. Details of these sources (including extraction method and additional links) are provided in Supplementary Table S1.
Fig. 1MMDB Entity Relationship Diagram. See Supplementary Table S2 for a full description of database variables.
Fig. 2Workflow for creating the Multimedia Monitoring Database. Details of each workflow phase are included in the Methods and Technical Validation sections.
Harmonized media identifiers in the multimedia monitoring database.
| Harmonized Medium | Description |
|---|---|
| ambient air | Outdoor ambient air |
| drinking water | Treated or untreated drinking water supplies, tap water, bottled drinking water, cooking water |
| groundwater | Water from groundwater sources (wells, aquifers) |
| product | Non-food consumer products |
| sediment | Freshwater or marine sediments |
| sludge | Sewage sludge |
| soil | Soil, sand, or outdoor settled dust |
| surface water | Lake, river, or marine surface water; includes rainwater |
| indoor air | Residential or other indoor air samples |
| indoor dust | Residential or other indoor dust samples (from any location) |
| landfill leachate | Landfill leachate (water having passed through landfill solids) |
| other-environmental | Other environmental media, not classified elsewhere |
| personal air | Personal air sample or exhaled breath |
| precipitation | Snow, rainfall, or other atmospheric deposition |
| wastewater (influent, effluent) | Inflow or outflow samples from municipal or industrial sites |
| breast milk | Human breast milk |
| human (other tissues or fluids) | Human tissues or fluids other than blood or urine, including nails, hair, semen, adipose tissue, saliva, sputum, sweat, amniotic, fluid, bone, and others |
| human blood (whole/serum/plasma) | Human whole blood, blood cells, serum, plasma, or other extractants, including fetal or umbilical samples |
| urine | Human urine |
| skin wipes | Wipes from human skin (any body surface) |
| wildlife (aquatic invertebrate) | Marine or freshwater invertebrates (e.g., crustaceans, mollusks etc.), any tissue |
| wildlife (aquatic vertebrates/mammals) | Non-fish aquatic vertebrates or mammals, any tissue |
| wildlife (birds) | Avian species, any tissue (including eggs) |
| wildlife (fish) | Fish species, any tissue |
| wildlife (terrestrial invertebrates/worms) | Terrestrial invertebrates, any tissue |
| wildlife (terrestrial vertebrates) | Terrestrial vertebrates, any tissue |
| other-ecological | Other ecological species not categorized elsewhere, including algae and seaweeds |
| vegetation | Terrestrial vegetation including non-processed fruits and vegetables |
| livestock/meat | Unprocessed meat products or samples from non-fish animals to be used as food |
| raw agricultural commodity | Unprocessed raw fruits, vegetables, grains, nuts, or seeds that have been grown for food |
| food product | Processed food products, including dairy products, breads, cooked meats, processed (e.g., canned or frozen) fruit and vegetable products, infant formula |
Records were assigned to “unknown” if the medium could not be determined from the reported information.
Summary of chemicals, media, and data by source.
| Data Type | Source | Unique Curated DTXSIDs | Unique Chemical Identifiers | Media Represented | Number of Observations |
|---|---|---|---|---|---|
| Summary | ahhs | 29 | 29 | indoor dust; soil | 57 |
| Single-sample | airmon | 9 | 9 | ambient air | 342540 |
| Summary | biomon_ca | 91 | 92 | human blood (whole/serum/plasma) | 2616 |
| Summary | ca_airmon | 41 | 44 | ambient air | 452 |
| Single-sample | ca_surf_sediment | 120 | 123 | sediment | 72205 |
| Single-sample | ca_surf_water | 362 | 380 | surface water | 497463 |
| Summary | carb | 10 | 11 | ambient air; indoor dust; personal air | 368 |
| Single-sample | chem_theatre | 424 | 498 | wildlife (fish); wildlife (aquatic vertebrates/mammals); wildlife (terrestrial vertebrates); wildlife (birds); sediment; wildlife (terrestrial invertebrates/worms); surface water; soil; wastewater (influent, effluent); vegetation; unknown; ambient air; groundwater | 49058 |
| Summary | ctd | 801 | 906 | unknown; product; human (other tissues or fluids); ambient air; personal air; indoor air; wildlife (fish); skin wipes; food product; raw agricultural commodity; human blood (whole/serum/plasma); wildlife (aquatic invertebrate); indoor dust; soil; livestock/meat; breast milk; vegetation; sediment; surface water; urine; wastewater (influent, effluent); drinking water; groundwater | 100826 |
| Single-sample | epa_9potw | 172 | 176 | wastewater (influent, effluent) | 3150 |
| Single-sample | epa_amtic | 91 | 217 | ambient air | 2871688 |
| Summary | epa_dmr | 825 | 1332 | wastewater (influent, effluent) | 4111611 |
| Summary | epa_nscrlft | 196 | 231 | wildlife (fish) | 3696 |
| Single-sample | epa_tnsss | 143 | 145 | sludge | 12181 |
| Single-sample | epa_ucmr | 33 | 35 | drinking water | 1036486 |
| Single-sample | fda_tds_elem | 19 | 35 | raw agricultural commodity; food product | 142365 |
| Single-sample | fda_tds_pest | 150 | 252 | raw agricultural commodity; food product | 20100 |
| Single-sample | ices_biota | 330 | 447 | wildlife (fish); wildlife (aquatic vertebrates/mammals); wildlife (birds); wildlife (aquatic invertebrate); vegetation; wildlife (terrestrial vertebrates) | 1262673 |
| Single-sample | ices_sediment | 303 | 391 | sediment | 533236 |
| Single-sample | ip_chem_biomonitoring | 137 | 176 | human (other tissues or fluids); urine; wildlife (fish); vegetation; wildlife (terrestrial vertebrates); wildlife (aquatic invertebrate); wildlife (terrestrial invertebrates/worms); human blood (whole/serum/plasma); sediment; wildlife (birds) | 182761 |
| Single-sample | ip_chem_biota | 74 | 99 | wildlife (aquatic invertebrate); wildlife (fish); other-ecological; vegetation; wildlife (birds); wildlife (aquatic vertebrates/mammals); wildlife (terrestrial vertebrates) | 826827 |
| Summary | ip_chem_ibs | 14 | 17 | urine | 4216 |
| Summary | ip_chem_lakes | 689 | 832 | surface water | 4761124 |
| Single-sample | ip_chem_seawater | 85 | 155 | surface water | 350160 |
| Single-sample | ip_chem_sediment | 82 | 112 | sediment | 338066 |
| Summary | nhanes | 244 | 444 | human blood (whole/serum/plasma); urine | 84665 |
| Single-sample | usda_nrp | 45 | 49 | livestock/meat; food product | 5051 |
| Single-sample | usgs | 2154 | 2852 | surface water; wildlife (fish); sediment; ambient air; groundwater; wastewater (influent, effluent); wildlife (aquatic invertebrate); wildlife (terrestrial invertebrates/worms); precipitation; wildlife (terrestrial vertebrates); soil; unknown; other-environmental; other-ecological; drinking water; vegetation; landfill leachate; wildlife (aquatic vertebrates/mammals); livestock/meat; wildlife (birds) | 46152942 |
Source subsets defined in Supplementary Table S1. For summary sources, the observations include different summary statistics for each chemical.
Fig. 3Location of origin of single-sample data in the Multimedia Monitoring Database. Single-sample data are from 42 countries, with most samples from the United States or Europe. Color denotes count of individual samples (Nsamples) in each country or U.S. state.
| Measurement(s) | occurrence of chemicals in environmental and biological media |
| Technology Type(s) | mass spectrometry (curation of public data) |
| Sample Characteristic - Organism | Homo sapiens • aquatic invertebrates • aquatic vertebrates/mammals • birds • fish • terrestrial invertebrates/worms • terrestrial vertebrates • livestock |
| Sample Characteristic - Environment | surface water • drinking water • groundwater • ambient air • indoor air • indoor dust • landfill leachate • personal air • precipitation • wastewater • soil • sediment • sludge • consumer products • food products • raw agricultural commodities |