| Literature DB >> 27481789 |
John La Salle1, Kristen J Williams2, Craig Moritz3.
Abstract
This paper explores what the virtual biodiversity e-infrastructure will look like as it takes advantage of advances in 'Big Data' biodiversity informatics and e-research infrastructure, which allow integration of various taxon-level data types (genome, morphology, distribution and species interactions) within a phylogenetic and environmental framework. By overcoming the data scaling problem in ecology, this integrative framework will provide richer information and fast learning to enable a deeper understanding of biodiversity evolution and dynamics in a rapidly changing world. The Atlas of Living Australia is used as one example of the advantages of progressing towards this future. Living in this future will require the adoption of new ways of integrating scientific knowledge into societal decision making.This article is part of the themed issue 'From DNA barcodes to biomes'.Entities:
Keywords: biodiversity informatics; biogeography; e-research infrastructure; environment; evolution
Mesh:
Year: 2016 PMID: 27481789 PMCID: PMC4971189 DOI: 10.1098/rstb.2015.0337
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Core principles to support e-research infrastructure for biodiversity knowledge generation.
| type | statement of intent |
|---|---|
| collaboration | we must develop an inclusive model for participation by all stakeholders, from local to national levels, in biodiversity information |
| sharing | we must adopt procedures to prevent duplication of effort, build on past investments and create shared efficiencies to the greater benefit of all |
| science | we must organize data to provide the best possible sustainable support for excellent, independent research, now and in the future |
| learning | we must enable novel or alternative approaches to new knowledge generation to be explored |
| integration | we must be able to bring different types of data into a shared environment |
| quality | we must enable users to understand the level of evidence and authority for all data elements and have services to help improve data quality at source |
| open access | we must promote and facilitate free and open use of data—and infrastructure |
| acknowledgement | we must create an environment where individual and collective endeavours can be recognized and built upon |
| delivery | we must provide comprehensive, stable, authoritative services that meet the needs of stakeholder groups |
| innovation | we must establish a model for continuous modernization and improvement of services. Open infrastructure will support innovative new uses of infrastructure and data |
| collect data once—make it freely accessible—use it many times | |
Figure 1.The ALA's phylogenetics tool integrates phylogenetic trees and spatial mapping so that phylogenies can be represented spatially, for example by species occurrence or character. Here, the occurrence of Acacia species from the clade highlighted by the blue node to the left is mapped and coloured by species.
Figure 2.The ALA scatterplot analysis maps distribution points (right) in two-dimensional environmental space; here, we show a grid of rainfall versus temperature (left). Placing the small box around the ‘hottest, driest’ points on the left produces the red circles for those points on the distribution map (for advanced examples see http://www.ala.org.au/spatial-portal-help/scatterplot/). The ‘cool, wet’ outliers on the plot are spurious locations in eastern Australia where the species does not occur naturally.
Figure 3.The ALA classify tool enables a selection of (ideally) relatively uncorrelated environmental layers for a predefined area to be classified into characteristic domains for a given number of groups, shown here for Tasmania—a large continental island off south-eastern Australia (image credit: http://www.ala.org.au/spatial-portal-help/classify/). The classification uses the ALOC algorithm [66].