| Literature DB >> 34593819 |
Daniel Falster1, Rachael Gallagher2,3, Elizabeth H Wenk4, Ian J Wright2, Dony Indiarto4, Samuel C Andrew5, Caitlan Baxter4, James Lawson6, Stuart Allen2, Anne Fuchs7, Anna Monro7, Fonti Kar4, Mark A Adams8, Collin W Ahrens3, Matthew Alfonzetti2, Tara Angevin9, Deborah M G Apgaua10, Stefan Arndt11, Owen K Atkin12, Joe Atkinson4, Tony Auld13, Andrew Baker14, Maria von Balthazar15, Anthony Bean16, Chris J Blackman17, Keith Bloomfield18, David M J S Bowman17, Jason Bragg19, Timothy J Brodribb17, Genevieve Buckton20, Geoff Burrows21, Elizabeth Caldwell22, James Camac23, Raymond Carpenter24, Jane A Catford25, Gregory R Cawthray26, Lucas A Cernusak27, Gregory Chandler28, Alex R Chapman29, David Cheal30, Alexander W Cheesman20, Si-Chong Chen31, Brendan Choat3, Brook Clinton7, Peta L Clode26, Helen Coleman29, William K Cornwell4, Meredith Cosgrove12, Michael Crisp12, Erika Cross21, Kristine Y Crous3, Saul Cunningham32, Timothy Curran33, Ellen Curtis34, Matthew I Daws35, Jane L DeGabriel36, Matthew D Denton37, Ning Dong2, Pengzhen Du38, Honglang Duan39, David H Duncan11, Richard P Duncan40, Marco Duretto41, John M Dwyer42, Cheryl Edwards43, Manuel Esperon-Rodriguez3, John R Evans12, Susan E Everingham4, Claire Farrell11, Jennifer Firn44, Carlos Roberto Fonseca45, Ben J French17, Doug Frood46, Jennifer L Funk47, Sonya R Geange12, Oula Ghannoum3, Sean M Gleason48, Carl R Gosper49, Emma Gray2, Philip K Groom50, Saskia Grootemaat4, Caroline Gross51, Greg Guerin52, Lydia Guja7, Amy K Hahs53, Matthew Tom Harrison54, Patrick E Hayes26, Martin Henery55, Dieter Hochuli56, Jocelyn Howell57, Guomin Huang58, Lesley Hughes2, John Huisman59, Jugoslav Ilic11, Ashika Jagdish4, Daniel Jin56, Gregory Jordan17, Enrique Jurado60, John Kanowski61, Sabine Kasel11, Jürgen Kellermann62, Belinda Kenny63, Michele Kohout64, Robert M Kooyman2, Martyna M Kotowska65, Hao Ran Lai66, Etienne Laliberté67, Hans Lambers26, Byron B Lamont50, Robert Lanfear68, Frank van Langevelde69, Daniel C Laughlin70, Bree-Anne Laugier-Kitchener2, Susan Laurance20, Caroline E R Lehmann71, Andrea Leigh34, Michelle R Leishman2, Tanja Lenz2, Brendan Lepschi7, James D Lewis72, Felix Lim73, Udayangani Liu31, Janice Lord74, Christopher H Lusk75, Cate Macinnis-Ng76, Hannah McPherson41, Susana Magallón77, Anthony Manea2, Andrea López-Martinez77, Margaret Mayfield42, James K McCarthy78, Trevor Meers79, Marlien van der Merwe19, Daniel J Metcalfe5, Per Milberg80, Karel Mokany5, Angela T Moles4, Ben D Moore3, Nicholas Moore9, John W Morgan9, William Morris11, Annette Muir64, Samantha Munroe52, Áine Nicholson17, Dean Nicolle81, Adrienne B Nicotra12, Ülo Niinemets82, Tom North7, Andrew O'Reilly-Nugent40, Odhran S O'Sullivan83, Brad Oberle84, Yusuke Onoda85, Mark K J Ooi86, Colin P Osborne87, Grazyna Paczkowska29, Burak Pekin88, Caio Guilherme Pereira89, Catherine Pickering90, Melinda Pickup91, Laura J Pollock92, Pieter Poot27, Jeff R Powell3, Sally A Power3, Iain Colin Prentice18, Lynda Prior17, Suzanne M Prober5, Jennifer Read22, Victoria Reynolds42, Anna E Richards5, Ben Richardson93, Michael L Roderick12, Julieta A Rosell77, Maurizio Rossetto41, Barbara Rye93, Paul D Rymer3, Michael A Sams42, Gordon Sanson22, Hervé Sauquet41, Susanne Schmidt94, Jürg Schönenberger15, Ernst-Detlef Schulze95, Kerrie Sendall96, Steve Sinclair65, Benjamin Smith3, Renee Smith3, Fiona Soper97, Ben Sparrow52, Rachel J Standish98, Timothy L Staples42, Ruby Stephens2, Christopher Szota11, Guy Taseski4, Elizabeth Tasker13, Freya Thomas11, David T Tissue3, Mark G Tjoelker3, David Yue Phin Tng10, Félix de Tombeur99, Kyle Tomlinson100, Neil C Turner26, Erik J Veneklaas26, Susanna Venn101, Peter Vesk11, Carolyn Vlasveld22, Maria S Vorontsova31, Charles A Warren56, Nigel Warwick51, Lasantha K Weerasinghe102, Jessie Wells42, Mark Westoby2, Matthew White64, Nicholas S G Williams11, Jarrah Wills56, Peter G Wilson103, Colin Yates49, Amy E Zanne104,105, Graham Zemunik26, Kasia Ziemińska73.
Abstract
We introduce the AusTraits database - a compilation of values of plant traits for taxa in the Australian flora (hereafter AusTraits). AusTraits synthesises data on 448 traits across 28,640 taxa from field campaigns, published literature, taxonomic monographs, and individual taxon descriptions. Traits vary in scope from physiological measures of performance (e.g. photosynthetic gas exchange, water-use efficiency) to morphological attributes (e.g. leaf area, seed mass, plant height) which link to aspects of ecological variation. AusTraits contains curated and harmonised individual- and species-level measurements coupled to, where available, contextual information on site properties and experimental conditions. This article provides information on version 3.0.2 of AusTraits which contains data for 997,808 trait-by-taxon combinations. We envision AusTraits as an ongoing collaborative initiative for easily archiving and sharing trait data, which also provides a template for other national or regional initiatives globally to fill persistent gaps in trait knowledge.Entities:
Mesh:
Year: 2021 PMID: 34593819 PMCID: PMC8484355 DOI: 10.1038/s41597-021-01006-6
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1The data curation pathway used to assemble the AusTraits database. Trait measurements are accessed from original data sources, including published floras and field campaigns. Features such as variable names, units and taxonomy are harmonised to a common standard. Versioned releases are distributed to users, allowing the dataset to be used and re-used in a reproducible way.
Main elements of the harmonised AusTraits database. See Tables 2–8 for details on each component.
| Element | Contents |
|---|---|
| traits | A table containing measurements of plant traits. |
| sites | A table containing observations of site characteristics associated with information in ‘traits’. Cross referencing between the two dataframes is possible using combinations of the variables ‘dataset_id’, ‘site_name’. |
| contexts | A table containing observations of contextual characteristics associated with information in ‘traits’. Cross referencing between the two dataframes is possible using combinations of the variables ‘dataset_id’, ‘context_name’. |
| methods | A table containing details on methods with which data were collected, including time frame and source. |
| excluded_data | A table of data that did not pass quality test and so were excluded from the master dataset. |
| taxa | A table containing details on taxa associated with information in ‘traits’. This information has been sourced from the APC (Australian Plant Census) and APNI (Australian Plant Name Index) and is released under a CC-BY3 license. |
| definitions | A copy of the definitions for all tables and terms. Information included here was used to process data and generate any documentation for the study. |
| sources | Bibtex entries for all primary and secondary sources in the compilation. |
| contributors | A table of people contributing to each study. |
| taxonomic_updates | A table of all taxonomic changes implemented in the construction of AusTraits. Changes are determined by comparing against the APC (Australian Plant Census) and APNI (Australian Plant Name Index). |
| build_info | A description of the computing environment used to create this version of the dataset, including version number, git commit and R session_info. |
Structure of the traits table, containing measurements of plant traits.
| key | value |
|---|---|
| dataset_id | Primary identifier for each study contributed into AusTraits; most often these are scientific papers, books, or online resources. By default should be name of first author and year of publication, e.g. ‘Falster_2005’. |
| taxon_name | Currently accepted name of taxon in the Australian Plant Census or in the Australian Plant Name Index. |
| site_name | Name of site where individual was sampled. Cross-references to identical columns in ‘sites’ and ‘traits’. |
| context_name | Name of contextual senario where individual was sampled. Cross-references to identical columns in ‘contexts’ and ‘traits’. |
| observation_id | A unique identifier for the observation, useful for joining traits coming from the same ‘observation_id’. These are assigned automatically, based on the ‘dataset_id’ and row number of the raw data. |
| trait_name | Name of trait sampled. |
| value | Measured value. |
| unit | Units of the sampled trait value after aligning with AusTraits standards. |
| date | Date sample was taken, in the format ‘yyyy-mm-dd’, but with days and months only when specified. |
| value_type | A categorical variable describing the type of trait value recorded. |
| replicates | Number of replicate measurements that comprise the data points for the trait for each measurement. A numeric value (or range) is ideal and appropriate if the value type is a ‘mean’, ‘median’, ‘min’ or ‘max’. For these value types, if replication is unknown the entry should be ‘unknown’. If the value type is ‘raw_value’ the replicate value should be 1. If the value type is ‘expert_mean’, ‘expert_min’, or ‘expert_max’ the replicate value should be ‘na’. |
| original_name | Name given to taxon in the original data supplied by the authors |
Structure of the contributors table, of people contributing to each study.
| key | value |
|---|---|
| dataset_id | Primary identifier for each study contributed into AusTraits; most often these are scientific papers, books, or online resources. By default should be name of first author and year of publication, e.g. ‘Falster_2005’. |
| name | Name of contributor |
| institution | Last known institution or affiliation |
| role | Their role in the study |
Structure of the sites table, containing observations of site characteristics associated with information in traits.
| key | value |
|---|---|
| dataset_id | Primary identifier for each study contributed into AusTraits; most often these are scientific papers, books, or online resources. By default should be name of first author and year of publication, e.g. ‘Falster_2005’. |
| site_name | Name of site where individual was sampled. Cross-references to identical columns in ‘sites’ and ‘traits’. |
| site_property | The site characteristic being recorded. Name should include units of measurement, e.g. ‘longitude (deg)’. Ideally we have at least these variables for each site - ‘longitude (deg)’, ‘latitude (deg)’, ‘description’. |
| value | Measured value. |
Possible value types of trait records.
| key | value |
|---|---|
| raw_value | Value is a direct measurement |
| site_min | Value is the minimum of measurements on multiple individuals of the taxon at a single site |
| site_mean | Value is the mean or median of measurements on multiple individuals of the taxon at a single site |
| site_max | Value is the maximum of measurements on multiple individuals of the taxon at a single site |
| multisite_min | Value is the minimum of measurements on multiple individuals of the taxon across multiple sites |
| multisite_mean | Value is the mean or median of measurements on multiple individuals of the taxon across multiple sites |
| multisite_max | Value is the maximum of measurements on multiple individuals of the taxon across multiple sites |
| expert_min | Value is the minimum observed for a taxon across its range or in this particular dataset, as estimated by an expert based on their knowledge of the taxon. Data fitting this category include estimates from floras that represent a taxon’s entire range. |
| expert_mean | Value is the mean observed for a taxon across its range or in this particular dataset, as estimated by an expert based on their knowledge of the taxon. Data fitting this category include estimates from floras that represent a taxon’s entire range, and values for categorical variables obtained from a reference book, or identified by an expert. |
| expert_max | Value is the maximum observed for a taxon across its range or in this particular dataset, as estimated by an expert based on their knowledge of the taxon. Data fitting this category include estimates from floras that represent a taxon’s entire range. |
| experiment_min | Value is the minimum of measurements from an experimental study either in the field or a glasshouse |
| experiment_mean | Value is the mean or median of measurements from an experimental study either in the field or a glasshouse |
| experiment_max | Value is the maximum of measurements from an experimental study either in the field or a glasshouse |
| individual_mean | Value is a mean of replicate measurements on an individual (usually for experimental ecophysiology studies) |
| individual_max | Value is a maximum of replicate measurements on an individual (usually for experimental ecophysiology studies) |
| literature_source | Value is a site or multi-site mean that has been sourced from an unknown literature source |
| unknown | Value type is not currently known |
Structure of the methods table, containing details on methods with which data were collected, including time frame and source.
| key | value |
|---|---|
| dataset_id | Primary identifier for each study contributed into AusTraits; most often these are scientific papers, books, or online resources. By default should be name of first author and year of publication, e.g. ‘Falster_2005’. |
| trait_name | Name of trait sampled. Allowable values specified in the table ‘traits’. |
| methods | A textual description of the methods used to collect the trait data. Whenever available, methods are taken near-verbatim from referenced source. Methods can include descriptions such as ‘measured on botanical collections’, ‘data from the literature’, or a detailed description of the field or lab methods used to collect the data. |
| year_collected_start | The year data collection commenced. |
| year_collected_end | The year data collection was completed. |
| description | A 1–2 sentence description of the purpose of the study. |
| collection_type | A field to indicate where the majority of plants on which traits were measured were collected - in the ‘field’, ‘lab’, ‘glasshouse’, ‘botanical collection’, or ‘literature’. The latter should only be used when the data were sourced from the literature and the collection type is unknown. |
| sample_age_class | A field to indicate if the study was completed on ‘adult’ or ‘juvenile’ plants. |
| sampling_strategy | A written description of how study sites were selected and how study individuals were selected. When available, this information is copied verbatim from a published manuscript. For botanical collections, this field ideally indicates which records were ‘sampled’ to measure a specific trait. |
| source_primary_citation | Citation for primary source. This detail is generated from the primary source in the metadata. |
| source_primary_key | Citation key for primary source in ‘sources’. The key is typically of format ‘Surname_year’. |
| source_secondary_citation | Citations for secondary source. This detail is generated from the secondary source in the metadata. |
| source_secondary_key | Citation key for secondary source in ‘sources’. The key is typically of format ‘Surname_year’. |
Structure of the taxonomic_updates table, of all taxonomic changes implemented in the construction of AusTraits. Changes are determined by comparing against the APC (Australian Plant Census) and APNI (Australian Plant Name Index).
| key | value |
|---|---|
| dataset_id | Primary identifier for each study contributed into AusTraits; most often these are scientific papers, books, or online resources. By default should be name of first author and year of publication, e.g. ‘Falster_2005’. |
| original_name | Name given to taxon in the original data supplied by the authors |
| cleaned_name | Name of the taxon after implementing any changes encoded for this taxon in the metadata file for the correpsonding ‘dataset_id’. |
| taxonIDClean | Where it could be identified, the ‘taxonID’ of the ‘cleaned_name’ for this taxon in the APC. |
| taxonomicStatusClean | Taxonomic status of the taxon identified by ‘taxonIDClean’ in the APC. |
| alternativeTaxonomicStatusClean | The status of alternative records with the name ‘cleaned_name’ in the APC. |
| acceptedNameUsageID | ID of the accepted name for taxon in the APC or APNI. |
| taxon_name | Currently accepted name of taxon in the APC or in the APNI . |
Structure of the taxa table, containing details on taxa associated with information in the traits table. This information has been sourced from the APC (Australian Plant Census) and APNI (Australian Plant Name Index) and is released under a CC-BY3 license.
| key | value |
|---|---|
| taxon_name | Currently accepted name of taxon in the APC or in the APNI . |
| source | Source of taxnonomic information, either APC or APNI. |
| acceptedNameUsageID | ID of the accepted name for taxon in the APC or APNI. |
| scientificNameAuthorship | Authority for taxon indicated under taxon_name. |
| taxonRank | Rank of the taxon. |
| taxonomicStatus | Taxonomic status of the taxon. |
| family | Family of the taxon. |
| genus | Genus of the taxon. |
| taxonDistribution | Known distribution of the taxon, by state. |
| ccAttributionIRI | Source of taxonomic information. |
Fig. 2Coverage of traits by taxa. (a) Matrix showing the coverage of taxa for each trait, with yellow indicating presence of data. The figure was generated with a subset of 500 randomly selected taxa. (b) Number of taxa with data for first 100 traits for all traits and separated by tissue.
Fig. 3Number of taxa with trait records by plant tissue and trait category, for data that are (a) Geo-referenced, and (b) Not geo-referenced. Many records without a geo-reference come from botanical collections, such as floras.
Fig. 4Coverage of geo-referenced trait records across Australian climatic and geographic space for traits in different categories. (a) AusTraits’ sites (orange) within Australia’s precipitation-temperature space (dark-grey) superimposed upon Whittaker’s classification of major biomes by climate[370]. Climate data were extracted at 10" resolution from WorldClim[371]. (b) Locations of geo-referenced records for different plant tissues.
Fig. 5Phylogenetic distribution of trait data in AusTraits for a subset of 2000 randomly sampled taxa. The heatmap colour intensity denotes the number of traits measured within a family for each plant tissue. The most widespread family names (with more than ten taxa) are labelled on the edge of the tree.
Fig. 6The number of taxa with trait records in AusTraits and global TRY database (accessed 28 May 2020). Each point shows a separate trait.
| Measurement(s) | plant trait |
| Technology Type(s) | digital curation |
| Sample Characteristic - Organism | Viridiplantae |
| Sample Characteristic - Location | Australia |
Structure of the contexts table, containing observations of contextual characteristics associated with information in traits.
| key | value |
|---|---|
| dataset_id | Primary identifier for each study contributed into AusTraits; most often these are scientific papers, books, or online resources. By default should be name of first author and year of publication, e.g. ‘Falster_2005’. |
| context_name | Name of contextual senario where individual was sampled. Cross-references to identical columns in ‘contexts’ and ‘traits’. |
| context_property | The contextual characteristic being recorded. Name should include units of measurement, e.g. ‘CO2 concentration (ppm)’. |
| value | Measured value. |