| Literature DB >> 17987112 |
Chris Yesson1, Peter W Brewer, Tim Sutton, Neil Caithness, Jaspreet S Pahwa, Mikhaila Burgess, W Alec Gray, Richard J White, Andrew C Jones, Frank A Bisby, Alastair Culham.
Abstract
There is a concerted global effort to digitize biodiversity occurrence data from herbarium and museum collections that together offer an unparalleled archive of life on Earth over the past few centuries. The Global Biodiversity Information Facility provides the largest single gateway to these data. Since 2004 it has provided a single point of access to specimen data from databases of biological surveys and collections. Biologists now have rapid access to more than 120 million observations, for use in many biological analyses. We investigate the quality and coverage of data digitally available, from the perspective of a biologist seeking distribution data for spatial analysis on a global scale. We present an example of automatic verification of geographic data using distributions from the International Legume Database and Information Service to test empirically, issues of geographic coverage and accuracy. There are over 1/2 million records covering 31% of all Legume species, and 84% of these records pass geographic validation. These data are not yet a global biodiversity resource for all species, or all countries. A user will encounter many biases and gaps in these data which should be understood before data are used or analyzed. The data are notably deficient in many of the world's biodiversity hotspots. The deficiencies in data coverage can be resolved by an increased application of resources to digitize and publish data throughout these most diverse regions. But in the push to provide ever more data online, we should not forget that consistent data quality is of paramount importance if the data are to be useful in capturing a meaningful picture of life on Earth.Entities:
Mesh:
Year: 2007 PMID: 17987112 PMCID: PMC2043490 DOI: 10.1371/journal.pone.0001124
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1All valid points collected from GBIF database
Figure 2GBIF points classified ‘In the sea’
Figure 3GBIF points classified ‘Lat/Long error’
Figure 4GBIF points classified ‘Lat/Long exactly zero’
Figure 5GBIF points classified ‘Near valid’
Figure 6GBIF points classified ‘Far from valid’
Top GBIF data providers for Legume data. Note: the species count is not cumulative as species data can be from more than one provider.
| Country-Provider | verified records (rank) | % total | % records verified | valid species | % total species |
| UK-National Biodiversity Network | 314,959 | 59.1% | 83.0% | 110 | 2.0% |
| Germany-Bundesamt für Naturschutz | 83,943 | 15.7% | 95.3% | 73 | 1.3% |
| Australia-National Herbarium of New South Wales | 24,950 | 4.7% | 94.3% | 1,140 | 21.0% |
| Australia-Centre for Plant Biodiversity Research | 20,361 | 3.8% | 87.4% | 1,604 | 29.6% |
| USA-Missouri Botanic Gardens | 20,174 | 3.8% | 68.2% | 2,562 | 47.2% |
| Australia-National Botanic Garden | 10,075 | 1.9% | 92.2% | 1,213 | 22.4% |
| Sweden-Lund Botanical Museum | 6,845 | 1.3% | 74.9% | 278 | 5.1% |
| UK-Environment and Heritage Service | 4,868 | 0.9% | 53.5% | 25 | 0.5% |
| USA-Arizona State University | 3,479 | 0.7% | 94.3% | 178 | 3.3% |
| Costa Rica-Instituto Nacional de Biodiversidad | 3,176 | 0.6% | 77.8% | 170 | 3.1% |
| Sweden-GBIF-SE:ArtDatabanken | 3,150 | 0.6% | 87.3% | 60 | 1.1% |
| All Others | 37,046 | 7.0% | 92.4% | - | - |
|
|
|
|
|
|
|
Figure 7The top 10 data suppliers of Legume records
Figure 8The top 10 data suppliers of Legume species
Figure 9Global Legume coverage from GBIF data per TDWG level 4 area.
Species coverage of GBIF records using ILDIS to define total species numbers
| TDWG Level 4 Name | Country | GBIF Records | GBIF Species | ILDIS Species | % Coverage |
| Sweden | Sweden | 5,897 | 69 | 100 | 69% |
| East Aegean Is. | Greece | 1,403 | 140 | 224 | 63% |
| Great Britain | United Kingdom | 319,915 | 110 | 183 | 60% |
| Spain | Spain | 12,857 | 366 | 623 | 59% |
| Costa Rica | Costa Rica | 6,013 | 270 | 479 | 56% |
| Greece | Greece | 5,031 | 259 | 491 | 53% |
| Ecuador | Ecuador | 3,339 | 201 | 383 | 52% |
| Nicaragua | Nicaragua | 2,545 | 136 | 262 | 52% |
| New South Wales | Australia | 20,776 | 493 | 964 | 51% |
| Western Australia | Australia | 11,855 | 873 | 1734 | 50% |
| Cocos I. | Costa Rica | 1 | 1 | 2 | 50% |
| Germany | Germany | 82,831 | 87 | 182 | 48% |
| Kriti | Greece | 454 | 112 | 237 | 47% |
| Queensland | Australia | 8,062 | 536 | 1170 | 46% |
| French Guyana | French Guyana | 2,401 | 188 | 417 | 45% |
| Alaska | United States | 1,126 | 27 | 62 | 44% |
| Iceland | Iceland | 76 | 8 | 19 | 42% |
| Tasmania | Australia | 451 | 50 | 120 | 42% |
| Senegal | Senegal | 440 | 123 | 298 | 41% |
| Northern Territory | Australia | 6,244 | 342 | 848 | 40% |
|
|
|
|
|
|
GBIF data for global the hotspots of Mittermeier et al. [19]
| Hotspot Name* | GBIF records (rank) | GBIF Species | % Total | % Total |
| Mediterranean Basin | 18,156 | 515 | 3.4% | 9.5% |
| Mesoamerica | 9,714 | 595 | 1.8% | 11.0% |
| Southwest Australia | 7,499 | 598 | 1.4% | 11.0% |
| Tropical Andes | 2,918 | 472 | 0.5% | 8.7% |
| Madrean Pine-Oak Woodlands | 1,413 | 246 | 0.3% | 4.5% |
| Tumbes-Choco-Magdalena | 1,259 | 224 | 0.2% | 4.1% |
| New Zealand | 1,023 | 22 | 0.2% | 0.4% |
| Madagascar and the Indian Ocean Islands | 762 | 217 | 0.1% | 4.0% |
| Eastern Afromontane | 567 | 265 | 0.1% | 4.9% |
| Atlantic Forest | 487 | 150 | 0.1% | 2.8% |
| Guinean Forests of West Africa | 303 | 136 | 0.1% | 2.5% |
| Cerrado | 240 | 117 | 0.0% | 2.2% |
| Coastal Forests of Eastern Africa | 222 | 118 | 0.0% | 2.2% |
| Indo-Burma | 157 | 106 | 0.0% | 2.0% |
| California Floristic Province | 115 | 67 | 0.0% | 1.2% |
| East Melanesian Islands | 104 | 37 | 0.0% | 0.7% |
| Polynesia-Micronesia | 87 | 25 | 0.0% | 0.5% |
| Sundaland | 69 | 40 | 0.0% | 0.7% |
| Caribbean Islands | 45 | 35 | 0.0% | 0.6% |
| Caucasus | 44 | 37 | 0.0% | 0.7% |
| Cape Floristic Region | 32 | 25 | 0.0% | 0.5% |
| Irano-Anatolian | 32 | 24 | 0.0% | 0.4% |
| Wallacea | 32 | 12 | 0.0% | 0.2% |
| Horn of Africa | 29 | 14 | 0.0% | 0.3% |
| Chilean Winter Rainfall and Valdivian Forests | 22 | 13 | 0.0% | 0.2% |
| Philippines | 15 | 14 | 0.0% | 0.3% |
| Mountains of Southwest China | 11 | 8 | 0.0% | 0.1% |
| New Caledonia | 11 | 4 | 0.0% | 0.1% |
| Japan | 5 | 1 | 0.0% | 0.0% |
| Mountains of Central Asia | 5 | 4 | 0.0% | 0.1% |
| Succulent Karoo | 3 | 2 | 0.0% | 0.0% |
| Western Ghats and Sri Lanka | 3 | 3 | 0.0% | 0.1% |
| Himalaya | 1 | 1 | 0.0% | 0.0% |
| Maputaland-Pondoland-Albany | 1 | 1 | 0.0% | 0.0% |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 10Species coverage on GBIF at a continental scale (TDWG level 1 continents).
Figure 11Hypothetical example of a point near a coastline which cross the land/sea barrier when referenced on grids of ¼, ½, and 1 degree resolution.
Co-ordinates are displayed by points.
Large herbaria and their contribution to GBIF at the time of this analysis.
| Code | Name of Herbarium | Country | Specimens | GBIF totals # | % Total |
| P | Muséum National d'Histoire Naturelle, Paris | France | 7,500,000 | 448,437 | 6% |
| K | Royal Botanic Gardens, Kew | UK | 7,000,000 | 0 | 0% |
| NY | New York Botanical Garden | USA | 7,000,000 | 91,037 | 1% |
| G | Conservatoire et Jardin botaniques de la Ville de Genève | Switzerland | 6,000,000 | 202,855 | 3% |
| LE | V. L. Komarov Botanical Institute, St. Petersburg | Russia | 5,770,000 | 0 | 0% |
| MO | Missouri Botanical Garden | USA | 5,522,000 | 1,966,000 | 36% |
| BM | Natural History Museum, London | UK | 5,200,000 | 232,418 | 4% |
| GH | Harvard University, Massachusetts | USA | 5,005,000 | 0 | 0% |
| S | Swedish Museum of Natural History, Stockholm | Sweden | 4,400,000 | 617,047 | 14% |
| US | Smithsonian Institution, Washington DC | USA | 4,340,000 | 0 | 0% |
| MPU | Université Montpellier | France | 4,000,000 | 0 | 0% |
|
|
|
|
|
Source: Index Herbariorum http://sciweb.nybg.org/science2/IndexHerbariorum.asp Accessed 10/2005. # Source: http://www.gbif.org Accessed 10/2005. Note: K now has c. 140,000 records, GH has c.220,000 records, and US has c.766,000 records on GBIF (09/2007), some other institutions have increased their online records substantially during the past 24 months.