| Literature DB >> 36212022 |
Meghan A Balk1,2, John Deck3,4, Kitty F Emery5, Ramona L Walls6,7, Dana Reuter8, Raphael LaFrance5, Joaquín Arroyo-Cabrales9, Paul Barrett8, Jessica Blois10, Arianne Boileau11, Laura Brenskelle12, Nicole R Cannarozzi5, J Alberto Cruz9, Liliana M Dávalos13, Noé U de la Sancha14,15, Prasiddhi Gyawali16, Maggie M Hantak5, Samantha Hopkins8,17, Brooks Kohli18, Jessica N King5, Michelle S Koo19, A Michelle Lawing20, Helena Machado8, Samantha M McCrane21, Bryan McLean22, Michèle E Morgan23, Suzanne Pilaar Birch24,25, Denne Reed26, Elizabeth J Reitz27, Neeka Sewnath12, Nathan S Upham28, Amelia Villaseñor29, Laurel Yohe30, Edward B Davis8,18, Robert P Guralnick5.
Abstract
Understanding variation of traits within and among species through time and across space is central to many questions in biology. Many resources assemble species-level trait data, but the data and metadata underlying those trait measurements are often not reported. Here, we introduce FuTRES (Functional Trait Resource for Environmental Studies; pronounced few-tress), an online datastore and community resource for individual-level trait reporting that utilizes a semantic framework. FuTRES already stores millions of trait measurements for paleobiological, zooarchaeological, and modern specimens, with a current focus on mammals. We compare dynamically derived extant mammal species' body size measurements in FuTRES with summary values from other compilations, highlighting potential issues with simply reporting a single mean estimate. We then show that individual-level data improve estimates of body mass-including uncertainty-for zooarchaeological specimens. FuTRES facilitates trait data integration and discoverability, accelerating new research agendas, especially scaling from intra- to interspecific trait variability.Entities:
Keywords: Animals; Biological database; Evolutionary history; Ornithology; Paleobiology; Phylogenetics; Systematics
Year: 2022 PMID: 36212022 PMCID: PMC9535407 DOI: 10.1016/j.isci.2022.105101
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1FuTRES data workflow
The FuTRES community collects data from a variety of sources: the field, the literature, online databases, or from museum collections. The users input data formatted to a template accessed through GEOME, which accommodates paleo-, zooarchaeo-, and neontological metadata types. FuTRES works with the user to preprocess the data, but is also building tools, such as an RShinyApp (https://github.com/futres/RShinyFuTRES), that will allow submitters to prepare their own data for GEOME. The trait terms are defined and standardized; if a term does not exist, the user can create an issue to request a term through https://github.com/futres/fovt. The data are then validated and stored in GEOME. The FuTRES workflow then converts the data into RDF triples and reasons over the ontology and terms, resulting in standardized, discoverable data. The FuTRES team provides a cleaning routine for the data, filtering data, simple metrics about data, mapping and visualization of data, and ultimately the download of data. The user then can access and discover trait data at the specimen level.
Summaries of traits ingested into FuTRES as of December 2020
| Trait (IRI) | Synonyms | Records (non-modern) | Species |
|---|---|---|---|
| body mass (OBA:VT0001259) | 196,098 | 2,357 | |
| body length with tail, | total length | 525,733 | 3,755 |
| ear length to notch (FOVT:0,000,005) | ear length | 406,953 | 2,714 |
| tail length (OBA:VT0002758) | 473,211 | 2,854 | |
| pes length (OBA:1,000,048) | hindfoot length | 469,877 | 2,789 |
| forearm length (OBA:VT0010023) | 19,346 | 614 | |
| astragalus lateral length (FOVT:0,000,013) | astragalus GLl | 767 (722) | 78 |
| astragalus breadth (FOVT:0,000,021) | astragalus width | 733 (688) | 76 |
| calcaneus length (FOVT:0,000,022) | calcaneus greatest length | 308 (289) | 48 |
| calcaneus width (FOVT:0,001,079) | 341 (311) | 51 | |
| humerus length (OBA:VT0004350) | 59 (45) | 12 | |
| tooth row length (FOVT:0,000,030) | 288 | 1 | |
Trait terms are the same as in the ontology (FOVT), with their IRI in parentheses. We also include counts for total number of records and for non-modern records. Synonyms for terms are either synonyms in the ontology or, in the case of the astragalus lateral length, the term we use in the paper to reflect terminology in von den Driesch (1976).
Figure 2Data cleaning method with example
(A–C). Here, we show Otospermophilus beecheyi as an example of the data cleaning process and success. Much data had unknown life stage (A), where purple colors denote known adults, yellow unknown life stage, and gray juveniles which we exclude from subsequent analyses. In this example, Otospermophilus beecheyi had 108 body mass records with no life stage reported. To remedy this, we created a distribution to test whether the unlabeled data were potentially adults. 1. Non-inferred, adult measurements were tested for outliers (results in B; gray bars below distributions are outliers). 2. From that set of data, we created +/−3σ upper and lower limits. 3. We tested the unlabeled, non-juvenile data against those limits (results in C; gray bars below distributions are outliers). Those within the limits we kept and labeled “possible adult; possibly good”, those outside of the limits were labeled “outliers” or “possible juvenile”.
Comparison of mean species’ mass between PanTHERIA and this study
| Group | N (%) | Within +/−3 | Outside +/−3 | >3 | < -3 |
|---|---|---|---|---|---|
| All | 773 (100%) | 244 (31.6%) | 529 (68.4%) | 422 (79.8%) | 107 (20.2%) |
| <100g | 559 (72.2%) | 171 (30.6%) | 388 (69.4%) | 301 (77.6%) | 83 (21.4%) |
| 100–1000g | 125 (16.2%) | 44 (35.2%) | 81 (64.8%) | 64 (79.0%) | 17 (21.0%) |
| 1000–10,000g | 46 (6.0%) | 18 (39.1%) | 28 (60.9%) | 26 (92.9%) | 2 (7.1%) |
| 10,000–100,000g | 31 (4.1%) | 7 (22.6%) | 24 (77.4%) | 23 (95.8%) | 1 (4.2%) |
N percentages are out of total species. Percent of species within or outside of 3 standard errors (se) are compared to the sample size (N) for that group. More often than not, species means from PanTHERIA are outside +/−3se of the means calculated in this study. When they are outside of +/−3se, PanTHERIA tends to overestimate mean body size.
Figure 3Differences between dynamic and static body mass estimates
The distribution of the number of standard errors (se) of the PanTHERIA mean body masses (indicated by the vertical hash marks along the x axis) is from FuTRES average body mass. Dotted line (dark gray) indicates the +/−3 se from FuTRES average body mass.
Figure 4Body mass estimation for zooarchaeological deer astragali
The relationship between modern deer astragali lateral length and body mass (black dots; black line) comes from data ingested to FuTRES from VertNet and K. Emery. Zooarchaeological data includes FuTRES data from K. Emery and additional data from Reitz et al. (2010). We predicted body mass (diamonds) from two sites (St. Catherines Island, 1565-1763 ACE in dark purple, and Fort Center, 200-800 ACE in light purple) and their associated +/− SE(vertical lines) from the relationship between modern deer astragali lateral length and body mass.
Constants for allometric equations for estimating the body mass of Odocoileus virginianus from astragalus lateral length measurements in FuTRES
| b( | df | p value | ||||
|---|---|---|---|---|---|---|
| This study | 1.45 (0.64) | 2.04 (0.97) | 25 | 27 | 0.29 | 0.004 |
| −6.79 | 5.29 | 10 | 0.87 |
Constants and needed information, such as SE(se) of the slope (b), intercept (log10(a)) and sample size, are needed to estimate (log10(y)), which in this case is body mass. We show our revised intercept, slope, r-squared value (R), and p value with degrees of freedom (df) for estimating body mass compared to those derived in the 1990s in the FM-EAP with a smaller sample size (unpublished data) and used in Reitz (2008).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| FuTRES Data | Zenodo Data: | |
| Discovery Environment | CyVerse | Raw data files: |
| Code used in paper | This paper | |
| fovt-data-pipeline | This paper | |
| Rfutres | This paper | |
| Traiter | ||
| OutlierDetection | ||
| FOVT (FuTRES Ontology of Vertebrate Traits) | This paper | |