| Literature DB >> 26967999 |
B Eugene Smith1, Mark K Johnston2, Robert Lücking1,3.
Abstract
Accuracy of taxonomic identifications is crucial to data quality in online repositories of species occurrence data, such as the Global Biodiversity Information Facility (GBIF), which have accumulated several hundred million records over the past 15 years. These data serve as basis for large scale analyses of macroecological and biogeographic patterns and to document environmental changes over time. However, taxonomic identifications are often unreliable, especially for non-vascular plants and fungi including lichens, which may lack critical revisions of voucher specimens. Due to the scale of the problem, restudy of millions of collections is unrealistic and other strategies are needed. Here we propose to use verified, georeferenced occurrence data of a given species to apply predictive niche modeling that can then be used to evaluate unverified occurrences of that species. Selecting the charismatic lichen fungus, Usnea longissima, as a case study, we used georeferenced occurrence records based on sequenced specimens to model its predicted niche. Our results suggest that the target species is largely restricted to a narrow range of boreal and temperate forest in the Northern Hemisphere and that occurrence records in GBIF from tropical regions and the Southern Hemisphere do not represent this taxon, a prediction tested by comparison with taxonomic revisions of Usnea for these regions. As a novel approach, we employed Principal Component Analysis on the environmental grid data used for predictive modeling to visualize potential ecogeographical barriers for the target species; we found that tropical regions conform a strong barrier, explaining why potential niches in the Southern Hemisphere were not colonized by Usnea longissima and instead by morphologically similar species. This approach is an example of how data from two of the most important biodiversity repositories, GenBank and GBIF, can be effectively combined to remotely address the problem of inaccuracy of taxonomic identifications in occurrence data repositories and to provide a filtering mechanism which can considerably reduce the number of voucher specimens that need critical revision, in this case from 4,672 to about 100.Entities:
Mesh:
Year: 2016 PMID: 26967999 PMCID: PMC4788202 DOI: 10.1371/journal.pone.0151232
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Maximum likelihood tree of Usnea longissima haplotypes using the fungal ITS barcoding marker.
Bootstrap values are given for supported branches (> 70). Black dots indicate Asian voucher samples (all others North America and one from Europe). Scale bar indicates rate of changes per site. The ingroup sequences of the JX978-series are individual haplotypes representing a total of 1477 sequenced specimens, with each sequence representing a selected specimen corresponding to that particular haplotype.
Fig 2Best fitting MexEnt model for Usnea longissima based on 1477 sequenced samples corresponding to 160 localities from Rolstad et al. (2013), indicated by shaded areas ranging from pale blue-green to red.
Bright blue areas indicate range of predicted absence. Map is overlayed by occurrence records from GBIF (white dots), and those falling outside the predicted range are marked with red circles. One dot may include more than one GBIF record (S1 Table).
Factor loadings of the environmental variables used in the PCA and total variance explained.
High loadings of >0.70 are highlighted in boldface and marked with an asterisk.
| Variable | Factor 1 | Factor 2 |
|---|---|---|
| tree_cover | 0.143366 | 0.441376 |
| alta | 0.155203 | -0.005793 |
| bio_1a (annual mean temperature) | 0.333335 | |
| bio_2a (mean diurnal temperature range) | -0.333610 | |
| bio_3a (isothermality) | 0.047748 | |
| bio_4a (temperature seasonality) | 0.000346 | |
| bio_5a (maximum temperature warmest month) | 0.556718 | |
| bio_6a (minimum temperature coldest month) | 0.175193 | |
| bio_7a (temperature annual range) | 0.163407 | |
| bio_8a (mean temperature wettest quarter) | 0.385510 | |
| bio_9a (mean temperature driest quarter) | 0.264902 | |
| bio_10a (mean temperature warmest quarter) | 0.489969 | |
| bio_11a (mean temperature coldest quarter) | 0.229091 | |
| bio_12a (annual precipitation) | -0.666655 | |
| bio_13a (precipitation wettest month) | -0.475273 | |
| bio_14a (precipitation driest month) | -0.292179 | |
| bio_15a (precipitation seasonality) | -0.256584 | 0.603328 |
| bio_16a (precipitation wettest quarter) | -0.699481 | -0.512586 |
| bio_17a (precipitation driest quarter) | -0.327346 | |
| bio_18a (precipitation warmest quarter) | -0.465097 | -0.614460 |
| bio_19a (precipitation coldest quarter) | -0.483883 | -0.624403 |
| Explained variance | 9.840462 | 5.099666 |
| Proportion of total | 46.8593% | 24.2841% |
Fig 3Global mapping of absolute distance scores derived from the first axis of a PCA ordination of environmental grid parameters used for the predictive niche modeling.
Distances were computed from an optimal environmental parameter set defined by the highest AUC values for grids with predicted distribution of Usnea longissima. Blue areas indicate zero or short ecological distances from the optimal grid whereas red areas indicate far distances (ecogeographical barriers). The tropics emerge as a strong barrier for the north-south distribution of the species.
GenBank Accession numbers and voucher information for specimens of Usnea longissima used in the phylogenetic and predictice modeling analysis.
| Genus | Species | GB Accession | Country | Collector | Number |
|---|---|---|---|---|---|
| AB051665 | Japan | Ohmura | 2911 | ||
| JX978183 | Canada | Rolstad et al. | U0001 | ||
| JX978184 | Canada | Rolstad et al. | U0002 | ||
| JX978185 | Canada | Rolstad et al. | U0006 | ||
| JX978188 | Canada | Rolstad et al. | U0039 | ||
| JX978189 | Canada | Rolstad et al. | U0056 | ||
| JX978190 | Canada | Rolstad et al. | U0100 | ||
| JX978191 | Canada | Rolstad et al. | U0116 | ||
| JX978192 | Canada | Rolstad et al. | U0170 | ||
| JX978201 | Canada | Rolstad et al. | U0594 | ||
| JX978210 | Canada | Rolstad et al. | U0918 | ||
| KF461130 | Canada | McMullin | sn | ||
| AJ748109 | Canada | KL | 68 | ||
| JX978186 | USA | Rolstad et al. | U0015 | ||
| JX978187 | USA | Rolstad et al. | U0035 | ||
| JX978193 | USA | Rolstad et al. | U0366 | ||
| JX978194 | USA | Rolstad et al. | U0426 | ||
| JX978195 | USA | Rolstad et al. | U0437 | ||
| JX978197 | USA | Rolstad et al. | U0482 | ||
| JX978198 | USA | Rolstad et al. | U0487 | ||
| JX978199 | USA | Rolstad et al. | U0551 | ||
| JX978200 | USA | Rolstad et al. | U0560 | ||
| JX978202 | USA | Rolstad et al. | U0601 | ||
| JX978203 | USA | Rolstad et al. | U0657 | ||
| JX978204 | USA | Rolstad et al. | U0708 | ||
| JX978205 | USA | Rolstad et al. | U0737 | ||
| JX978206 | USA | Rolstad et al. | U0742 | ||
| JX978207 | USA | Rolstad et al. | U0776 | ||
| JX978208 | USA | Rolstad et al. | U0783 | ||
| JX978209 | USA | Rolstad et al. | U0841 | ||
| JX978211 | USA | Rolstad et al. | U1009 | ||
| JX978212 | USA | Rolstad et al. | U1086 | ||
| JX978213 | USA | Rolstad et al. | U1590 | ||
| JX978214 | USA | Rolstad et al. | U1592 | ||
| JX978196 | Sweden | Rolstad et al. | U0456 | ||
| AJ748108 | India | KL | 88 | ||
| DQ383647 | SouthKorea | Hur | CH050148 | ||
| DQ001304 | SouthKorea | Hur | 040001 | ||
| AB051642 | Japan | Ohmura | 2877 | ||
| AB051643 | Japan | Ohmura | 2881 | ||
| AB051644 | Japan | Ohmura | 3250 | ||
| AB051645 | Japan | Ohmura | 3664 | ||
| AB051646 | Japan | Ohmura | 3816A | ||
| AB051647 | Japan | Ohmura | 3816B | ||
| AB051648 | Japan | Ohmura | 3844 | ||
| FJ494936 | Taiwan | Shen | L00004685 |