| Literature DB >> 35794183 |
Vaughn Shirey1, Elise Larsen1, Andra Doherty1, Clifford A Kim1, Faisal T Al-Sulaiman1, Jomar D Hinolan2, Micael Gabriel A Itliong2, Mark Arcebal K Naive3,4,5, Minji Ku6, Michael Belitz6, Grace Jeschke1, Vijay Barve6,7, Gerardo Lamas8, Akito Y Kawahara6, Robert Guralnick6, Naomi E Pierce9, David J Lohman2,10,11, Leslie Ries12.
Abstract
Here, we present the largest, global dataset of Lepidopteran traits, focusing initially on butterflies (ca. 12,500 species records). These traits are derived from field guides, taxonomic treatments, and other literature resources. We present traits on wing size, phenology,voltinism, diapause/overwintering stage, hostplant associations, and habitat affinities (canopy, edge, moisture, and disturbance). This dataset will facilitate comparative research on butterfly ecology and evolution and our goal is to inspire future research collaboration and the continued development of this dataset.Entities:
Mesh:
Year: 2022 PMID: 35794183 PMCID: PMC9259668 DOI: 10.1038/s41597-022-01473-5
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 2Geographic breadth of our butterfly trait resources. Using a global map of level-two regions (World Geographic Scheme, Brummitt 2001), we have indicated the total number of resources available within each geographic area). Grey areas indicate that no resources were extracted from that region.
The total number of species represented by each trait in LepTraits 1.0.
| Number of Species Records | |
|---|---|
| Wingspan (cm) | 8,417 |
| Forewing Length (cm) | 895 |
| Phenology (all traits) | 6,518 |
| Voltinism | 3,131 |
| Diapause Stage | 859 |
| Canopy Affinity | 7,931 |
| Edge Affinity | 4,406 |
| Moisture Affinity | 4,842 |
| Disturbance Affinity | 2,868 |
| Host Plants (all traits) | 5,016 |
| Oviposition Style | 1,685 |
Fig. 1A graphical illustration of the processing workflow used to compile, scan, digitize, extract, atomize, and compile species trait records from literature resources. (1) Literature resources were examined for potential trait data and compiled into a single library; (2) each literature resource was scanned into.pdf format so that text could be readily copy and pasted from species accounts; (3) each.pdf file was uploaded to an online database with associated metadata for each literature resource; (4) trait extractors utilized an online interface to extract verbatim, raw text from designated resources; (5) verbatim, raw text extracts were either automatically (via regular-expressions and keyword searches) or manually atomized to a controlled vocabulary; (6) species consensus traits were calculated by aggregating resource-level records by name-normalized taxonomy. Rulesets were used for consensus trait building and are detailed in the supplementary material. Both resource-level and species consensus traits are presented in the dataset.
Fig. 3Geographic breadth of our butterfly trait records. Using a global map of level-two regions(World Geographic Scheme, Brummitt 2001), we have indicated the total number of trait records from each geographic region). Grey areas indicate that trait records were not extracted from that region.
The number of species represented within each family in LepTraits 1.0.
| Taxonomic Family | Number of Species Records |
|---|---|
| Hesperiidae | 2,754 |
| Lycaenidae | 3,395 |
| Nymphalidae | 3,978 |
| Papilionidae | 532 |
| Pieridae | 733 |
| Riodinidae | 1,056 |
| TOTAL | 12,448 |
| Measurement(s) | Wingspan • Habitat Affinity • oviposition • voltinism • phenology • hostplant association |
| Technology Type(s) | natural language processing |
| Sample Characteristic - Organism | Lepidoptera |
| Sample Characteristic - Location | Global |