| Literature DB >> 33776581 |
Laura Twardochleb1,2, Ethan Hiltner1, Matthew Pyne3, Phoebe Zarnetske2,4.
Abstract
MOTIVATION: Freshwater insects comprise 60% of freshwater animal diversity; they are widely used to assess water quality, and they provide prey for numerous freshwater and terrestrial taxa. Our knowledge of the distribution of freshwater insect diversity in the USA is incomplete because we lack comprehensive, standardized data on their distributions and functional traits at the scale of the contiguous United States (CONUS). We fill this knowledge gap by presenting Freshwater insects CONUS: A database of freshwater insect occurrences and traits for the contiguous United States. This database includes 2.05 million occurrence records for 932 genera in the major freshwater insect orders, at 51,044 stream locations sampled between 2001 and 2018 by federal and state biological monitoring programmes. Compared with existing open-access databases, we tripled the number of occurrence records and locations and added records for 118 genera. We also present life-history, dispersal, morphological and ecological traits and trait affinities (analogous to fuzzy-coded traits) for 1,007 stream insect genera, assembled from existing databases, reference books and the primary literature. We nearly doubled the number of traits for 11 trait groups and added traits for 180 genera that were not available from open-access databases. Our database, Freshwater insects CONUS, facilitates the mapping of freshwater insect taxonomic and functional diversity and, when paired with environmental data, will provide a powerful resource for quantifying how the environment shapes stream insect diversity and taxon-specific distributions. MAIN TYPES OF VARIABLES CONTAINED: Georeferenced occurrence records and traits for stream insects. SPATIAL LOCATION AND GRAIN: Contiguous United States at a grain of c. 1 m2. TIME PERIOD AND GRAIN: Occurrence records from January 2001 to December 2018, with 1-day temporal resolution. Traits from January 1911 to December 2018. MAJOR TAXA AND LEVEL OF MEASUREMENT: Genera from the orders Coleoptera, Diptera, Ephemeroptera, Hemiptera, Lepidoptera, Megaloptera, Neuroptera, Odonata, Plecoptera and Trichoptera. SOFTWARE FORMAT: .csv.Entities:
Keywords: contiguous United States; freshwater insects; functional traits; fuzzy‐coded traits; macroinvertebrates; occurrence records; streams; trait affinities
Year: 2021 PMID: 33776581 PMCID: PMC7986927 DOI: 10.1111/geb.13257
Source DB: PubMed Journal: Glob Ecol Biogeogr ISSN: 1466-822X Impact factor: 7.144
Functional traits of freshwater insects
| Grouping feature | Trait group | Trait | Definition | Definition citation |
|---|---|---|---|---|
| Life history | Number of generations per year | Semivoltine | Less than one generation per year | Poff et al. ( |
| Univoltine | One generation per year | Poff et al. ( | ||
| Bi_multivoltine | More than one generation per year | Poff et al. ( | ||
| Synchronization of emergence | Well | Emergence occurs within a matter of days | Poff et al. ( | |
| Poorly | Emergence occurs within a matter of weeks or months | Poff et al. ( | ||
| Emergence season | Spring | Emergence between the months of March and May | ||
| Summer | Emergence between the months of June and August | |||
| Fall | Emergence between the months of September and November | |||
| Winter | Emergence between the months of December and February | |||
| Dispersal | Female dispersal | Low | <1 km flight before laying eggs | Poff et al. ( |
| High | >1 km flight before laying eggs | Poff et al. ( | ||
| Adult flying strength | Weak | Taking frequent breaks while flying, or flight is low to the ground | Poff et al. ( | |
| Strong | Able to fly into a light breeze or fly for several miles without breaks | Poff et al. ( | ||
| Morphology | Maximum body size | Small | <9 mm | Poff et al. ( |
| Medium | 9–16 mm | Poff et al. ( | ||
| Large | >16 mm | Poff et al. ( | ||
| Respiration mode | Tegument | An outer covering, outer enveloping cell layer or membrane used to acquire oxygen | Merritt et al. ( | |
| Gills | A thin‐walled structure with trachea, used for the absorption of oxygen | Arnett ( | ||
| Plastron, spiracle | Oxygen is absorbed from the atmosphere, from aquatic plants or from a temporary air store, such as an air film or bubble on the surface of the body, or a permanent air store (a plastron) | Merritt et al. ( | ||
| Ecology | Rheophily | Depo | Occupies running‐water pools or margins with fine sediments (sand and silt) | Merritt et al. ( |
| Depo_eros | Occupies both erosional and depositional habitats | Merritt et al. ( | ||
| Eros | Occupies running‐water riffles with coarse sediments (cobbles, pebble, gravel) | Merritt et al. ( | ||
| Thermal preference | Cold stenothermal | <5 °C | Vieira et al. ( | |
| Cold‐cool eurythermal | 0–15 °C | Vieira et al. ( | ||
| Cool‐warm eurythermal | 5–30 °C | Vieira et al. ( | ||
| Warm eurythermal | 15–30 °C | Vieira et al. ( | ||
| Hot eurythermal | >30 °C | Vieira et al. ( | ||
| Habit | Crawler | Adapted for crawling on the surface of floating leaves of vascular hydrophytes or fine sediments on the bottom of water bodies | Merritt et al. ( | |
| Burrower | Inhabiting the fine sediment of streams and lakes | Merritt et al. ( | ||
| Clinger | Representatives have behavioural and morphological adaptations for attachment to surfaces in stream riffles and wave‐swept rocky littoral zones of lakes | Merritt et al. ( | ||
| Skater | Adapted for skating on the surface, where they feed as scavengers on organisms trapped in the surface film | Merritt et al. ( | ||
| Swimmer | Adapted for fish‐like swimming in lotic or lentic habitats | Merritt et al. ( | ||
| Sprawler | Inhabiting the surface of floating leaves of vascular hydrophytes or fine sediments | Merritt et al. ( | ||
| Climber | Adapted for living on vascular hydrophytes or detrital debris, with modifications for moving vertically on stem‐type surfaces | Merritt et al. ( | ||
| Planktonic | Inhabiting the open water limnetic zone of standing waters | Merritt et al. ( | ||
| Feeding style | Predator | Insects that ingest prey whole or in parts (engulfers) or that pierce prey tissues and suck fluids (piercers) | Merritt et al. ( | |
| Collector‐gatherer | Insects that collect and consume decomposing organic matter | Cummins ( | ||
| Collector‐filterer | Insects that collect and filter living algal cells or detritus | Merritt et al. ( | ||
| Herbivore | Insects that scrape algae or that shred or pierce living aquatic plants | Merritt et al. ( | ||
| Shredder | Insects that shred decomposing vascular plant tissue (detritivores) | Poff et al. ( | ||
| Parasite | Parasites that consume living animal tissue | Merritt et al. ( |
To be consistent with the unified trait terminology for stream organisms proposed by Schmera et al., (2015), we have reorganized traits by grouping feature and trait groups. A definition for each trait and literature citation for that definition are provided.
FIGURE 1Database assembly steps. Steps for traits are shown in green boxes and occurrence records in blue boxes. We assembled our trait dataset from the U.S. Environmental Protection Agency (USEPA) Biological Traits Database, taxonomic guides and entomology texts, scientific articles, and with the help of taxonomic experts. The occurrence dataset was assembled from data from the Water Quality Portal and requests to state environmental agencies. We recorded trait data following definitions in Table 1 and recorded state sampling methodology based on field sampling manuals from state agencies. We digitized data in Microsoft Excel. We then performed data cleaning and taxonomic harmonization in R, using the package “taxize”. Finally, we assigned modal traits, as the most commonly occurring trait in a trait group for each genus, and a trait affinity, or the percentage affinity of a genus toward each trait in a trait group. Icons are from IAN Symbol Libraries (https://ian.umces.edu/symbols/)
FIGURE 2Database layout, with connecting lines indicating relationships among tables. Orange boxes are the “raw” community and trait datasets cleaned from data in the Data_Sources table (purple) using R scripts. “Cleaned” trait tables are shown in green and occurrence records in blue. Tables of ancillary information are in grey. From left to right: Raw_Traits contains data for each genus varying by location, species and literature source, which we digitized and cleaned during steps 2, 3 and 4 of database assembly (Figure 1). Genus_Traits and Genus_Trait_Affinities contain modal traits and trait affinities that we produced from Raw_Traits using R scripts during step 5. Ancillary_Trait contains information about each trait (Table 1). Genus_Occurrences contains occurrence records that we produced from Raw_Community_Data using R scripts in database assembly steps 3 and 4. Ancillary_Taxonomy contains taxonomic names recorded in the Water Quality Portal (WQP), state data and U.S. Environmental Protection Agency (USEPA) database, with their corresponding accepted names, taxonomic serial numbers and higher taxonomic designations obtained during step 4. Raw_Community_Data contains occurrence data from the WQP and state agencies supplied in data tables listed in Data_Sources. We recorded additional data about state sampling methodology in Ancillary_Sample_Method during step 2. We cleaned the data files in Data_Sources using R scripts during steps 3 and 4
Contents and relationships among data tables (Figure 2)
| Data table name | Content | Links to other tables | Database assembly steps |
|---|---|---|---|
| Raw_Traits | Cleaned trait data using R scripts for each taxonomic name (“Submitted name_trait”, usually genus, occasionally species or family) recorded in datasets from the WQP, state agencies or USEPA. There are multiple trait entries separated by row for each taxon, with each row presenting trait data recorded from a different location, species or literature source | Ancillary_Taxonomy through “Submitted_name” column | 1, 2, 3, 4 |
| Genus_Traits | Modal traits for each genus assigned from data in Raw_Traits using R scripts | Genus_Trait_Affinities and Ancillary_Trait through “Trait” column. Genus_Occurrences and Ancillary_Taxonomy through “Genus” column | 5 |
| Genus_Trait_Affinities | Trait affinities for each genus assigned from data in Raw_Traits using R scripts | Linkages are the same as for Genus_Traits, above | 5 |
| Ancillary_Trait | Information about traits contained in Table | Genus_Traits and Genus_Trait_Affinities through “Trait” column | 1 |
| Genus_Occurrences | Genus occurrence records produced from Raw_Community_Data using R scripts | Genus_Traits through “Trait” column and Ancillary_Taxonomy through “Genus” column | 3, 4 |
| Ancillary_Taxonomy | Data from taxonomic harmonization, including taxonomic names (“Submitted_name”) recorded in the WQP, state data and USEPA database and the corresponding accepted names, taxonomic serial numbers and higher taxonomic designations. Users can search on any column in Ancillary_Taxonomy and find corresponding occurrence and trait records in other tables | Raw_Traits and Raw_Community_Data through “Submitted_name” column. Genus_Traits, Genus_Trait_Affinities and Genus_Occurrences through “Genus” column | 4 |
| Data_Sources | Information about source data files, state agency websites and agency contacts | 1 | |
| Raw_Community_Data | Cleaned occurrence data from the WQP and state agencies using R scripts. Includes records for taxa identified to species, genus, family or order | Genus_Occurrences through “Unique_ID”. Ancillary_Taxonomy through “Submitted_name”. Ancillary_Sample_Method through “Sample_method” | 2, 3, 4 |
| Ancillary_Sample_Method | Detailed methodology for sample methods in Raw_Community_Data | Raw_Community_Data through “Sample_method” and Data_Sources through “Data_source” | 2 |
“Links to other tables” indicates which columns can be used to join related tables. The database assembly steps (Figure 1) involved in creating each table are also provided.
Abbreviations: USEPA, U.S. Environmental Protection Agency; WQP, Water Quality Portal.
FIGURE 3(a,b) Genus richness by occurrence location for all orders (a) and for each order individually (b). Dark points indicate low genus richness and red indicate high richness. Note that genus richness has not been corrected for sampling bias
FIGURE 4Proportion of genera at each occurrence location assigned a modal trait of bivoltine–multivoltine (number of generations per year), erosional (rheophily), gills (respiration mode) and warm eurythermal (thermal preference). Dark points are sites where a low proportion of genera have the trait, and yellow points indicate that a high proportion have the trait
Number of genus occurrence and trait records by insect order
| Order | Number of genus occurrence records | Number of genus occurrence locations | Number of genera with occurrence records | Number of genera with trait records | Number of species with records in each order |
|---|---|---|---|---|---|
| Coleoptera | 210,077 | 44,669 | 145 | 160 | 464 |
| Diptera | 862,826 | 49,572 | 335 | 363 | 556 |
| Ephemeroptera | 381,077 | 46,737 | 93 | 100 | 426 |
| Hemiptera | 22,528 | 10,231 | 48 | 56 | 140 |
| Lepidoptera | 3,756 | 2,956 | 18 | 4 | 9 |
| Megaloptera | 25,056 | 15,302 | 9 | 8 | 13 |
| Neuroptera | 389 | 362 | 3 | 2 | 4 |
| Odonata | 73,760 | 23,102 | 66 | 73 | 427 |
| Plecoptera | 137,377 | 29,155 | 97 | 99 | 401 |
| Trichoptera | 338,460 | 46,682 | 127 | 145 | 673 |
FIGURE 5(a) Traits: Number of genera assigned a modal trait for each trait group after data cleaning and taxonomic harmonization with data originating from the U.S. Environmental Protection Agency (USEPA) traits database (black bars; USEPA) versus our database (green bars; CONUS). (b) Occurrence records: Locations after data cleaning originating from the WQP (black points) versus our database (blue points; CONUS)