| Literature DB >> 29732259 |
Christophe Botella1,2,3,4, Alexis Joly1, Pierre Bonnet3,5, Pascal Monestiez4, François Munoz6.
Abstract
PREMISE OF THE STUDY: A species distribution model computed with automatically identified plant observations was developed and evaluated to contribute to future ecological studies.Entities:
Keywords: automated species identification; citizen science; crowdsourcing; deep learning; invasive alien species; species distribution modeling
Year: 2018 PMID: 29732259 PMCID: PMC5851560 DOI: 10.1002/aps3.1029
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Figure 1Four unvalidated Pl@ntNet plant pictures representing, or identified as, Acer monspessulanum and their respective predicted confidence values for the highest ranked species (the sum of scores over all species is always 100). (A) The species is A. monspessulanum and is well predicted. (B) The species is A. monspessulanum, but the model confounds it with A. campestre. (C) The species is A. monspessulanum or A. pseudoplatanus, but the species cannot be determined with the fruit only; there is an intrinsic taxonomic uncertainty. (D) The species is Hedera helix but is predicted as A. monspessulanum because this leaf is quite similar, as one can compare with (A).
Figure 2The number of Pl@ntNet observations per species and per confidence values p(kmax|x).
Detailed number of occurrences in the Inventaire National du Patrimoine Naturel (INPN) data set by species
| Species name | No. of observations | No. of 100‐km2 areas |
|---|---|---|
|
| 5217 | 904 |
|
| 484 | 114 |
|
| 711 | 306 |
|
| 120 | 44 |
|
| 14,278 | 2623 |
List and details of the environmental descriptors used in this study
| Name | Description | Nature | Values | Local image |
|---|---|---|---|---|
| CHBIO_2 | Mean monthly temp (max, min) | quanti. | [7.8, 21.0] | Yes |
| CHBIO_7 | Temp. annual range | quanti. | [16.7, 42.0] | Yes |
| CHBIO_8 | Mean temp. of wettest quarter | quanti. | [−14.2, 23.0] | Yes |
| CHBIO_9 | Mean temp. of driest quarter | quanti. | [−17.7, 26.5] | Yes |
| CHBIO_10 | Mean temp. of warmest quarter | quanti. | [−2.8, 26.5] | Yes |
| CHBIO_11 | Mean temp. of coldest quarter | quanti. | [−17.7, 11.8] | Yes |
| CHBIO_13 | Precip. of wettest month | quanti. | [43.0, 285.5] | Yes |
| CHBIO_14 | Precip. of driest month | quanti. | [3.0, 135.6] | Yes |
| CHBIO_15 | Precip. seasonality (CV) | quanti. | [8.2, 26.5] | Yes |
| CHBIO_18 | Precip. of warmest quarter | quanti. | [19.8, 851.7] | Yes |
| CHBIO_19 | Precip. of coldest quarter | quanti. | [60.5, 520.4] | Yes |
| etp | Potential evapotranspiration | quanti. | [133, 1176] | Yes |
| alti | Elevation | quanti. | [−188, 4672] | Yes |
| shade | Shade level | quanti. | [0, 1] | No |
| slope | Ground slope | quanti. | [0, 13457] | No |
| dmer | Distance to coastline | quanti. | [|0, 32767|] | No |
| droute | Distance to roads | quanti. | [|0, 32767|] | No |
| proxi_eau | <50 m to fresh water | bool. | {0, 1} | Yes |
| awc_top | Topsoil available water capacity | ordinal | {0, 120, 165, 210} | Yes |
| bs_top | Base saturation of the topsoil | ordinal | {35, 62, 85} | Yes |
| cec_top | Topsoil cation exchange capacity | ordinal | {7, 22, 50} | Yes |
| crusting | Soil crusting class | ordinal | [|0, 5|] | Yes |
| dgh | Depth to a gleyed horizon | ordinal | {20, 60, 140} | Yes |
| dimp | Depth to an impermeable layer | ordinal | {60, 100} | Yes |
| erodi | Soil erodibility class | ordinal | [|0, 5|] | Yes |
| oc_top | Topsoil organic carbon content | ordinal | {1, 2, 4, 8} | Yes |
| pd_top | Topsoil packing density | ordinal | {1, 2} | Yes |
| text | Dominant surface textural class | ordinal | [|0, 5|] | Yes |
| clc | Ground occupation | categ. | [|1, 48|] | Yes |
bool. = Boolean data; categ. = categorical data; CV = coefficient of variation of monthly precipitation; quanti. = quantitative data.
Data presented in curly brackets ({ }) contain the list of all possibles values of the variable, i.e., a discrete ensemble; square brackets ([ ]) indicate the continuous range of values that can take the variable, i.e., a continuous interval; vertical lines indicate the range of integers between the two bounds given, i.e., a discrete interval.
Figure 3Predictive effectiveness of the species distribution models trained on Pl@ntNet data as a function of the confidence threshold value pmin(kmax|x) showing accuracy on the 10% densest quadrats (A) and true skill statistics (TSS; conversion of prediction value into presence/absence with the threshold that maximizes TSS) (B).
Figure 4Maps of species distribution models computed from Pl@ntNet data (based on pmin(kmax|x) = 70%) and of expert count data from the Inventaire National du Patrimoine Naturel (INPN). The sensibility and specificity used for the computation of the true skill statistics (for pmin(kmax|x) = 70) is provided for each species.
Figure 5Pl@ntNet observations with a species prediction score of more than 70% for plants living in natural conditions or cultivated for ornamental purpose.