| Literature DB >> 25685027 |
Jonathan Silvertown1, Martin Harvey2, Richard Greenwood3, Mike Dodd2, Jon Rosewell4, Tony Rebelo5, Janice Ansine2, Kevin McConway4.
Abstract
Accurate species identification is fundamental to biodiversity science, but the natural history skills required for this are neglected in formal education at all levels. In this paper we describe how the web application ispotnature.org and its sister site ispot.org.za (collectively, "iSpot") are helping to solve this problem by combining learning technology with crowdsourcing to connect beginners with experts. Over 94% of observations submitted to iSpot receive a determination. External checking of a sample of 3,287 iSpot records verified > 92% of them. To mid 2014, iSpot crowdsourced the identification of 30,000 taxa (>80% at species level) in > 390,000 observations with a global community numbering > 42,000 registered participants. More than half the observations on ispotnature.org were named within an hour of submission. iSpot uses a unique, 9-dimensional reputation system to motivate and reward participants and to verify determinations. Taxon-specific reputation points are earned when a participant proposes an identification that achieves agreement from other participants, weighted by the agreers' own reputation scores for the taxon. This system is able to discriminate effectively between competing determinations when two or more are proposed for the same observation. In 57% of such cases the reputation system improved the accuracy of the determination, while in the remainder it either improved precision (e.g. by adding a species name to a genus) or revealed false precision, for example where a determination to species level was not supported by the available evidence. We propose that the success of iSpot arises from the structure of its social network that efficiently connects beginners and experts, overcoming the social as well as geographic barriers that normally separate the two.Entities:
Keywords: Biodiversity; Citizen Science; Crowdsourcing; Identification; Learning; Learning design; social networking
Year: 2015 PMID: 25685027 PMCID: PMC4319112 DOI: 10.3897/zookeys.480.8803
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Schematic of three network structures, from equal status, to expert status and a hybrid system. (Icons http://www.icons-land.com)
The conceptual social network structure of iSpot, showing the group (3 of the 8) compartmentalization and its interaction. Not shown is the learner-mentor interaction within each group. (Icons http://www.icons-land.com)
Figure 3.The network linking participants who posted observations to iSpotnature.org without an identification and those providing a likely identification for those observations. The diagram is based on the sample of 5,000 identifications made up until 1 July 2014. This activity occurred over 32 days. The network contains 1,110 nodes linked by 2,876 edges. Red nodes (83.23%) are participants who only received identifications, green nodes (6.38%) are participants who only made identifications and blue nodes (10.39%) are participants who both made and received identifications.
Figure 4.A map of the social network of a single participant who contributed about 500 observations to iSpotnature.org at the locations shown by blue dots, with other iSpot participants, living at locations shown by red stars, who provided determinations for those observations. The location of the observation and the location of the person identifying it are joined by a line.
Figure 5.An example of an iSpot participant’s profile showing their reputation and activity in the 8 groups.
Figure 6.Schematic showing how the iSpot reputation system works. See the text for further explanation.
The characteristics of the iSpot reputation system evaluated against the requirements for online reputation systems proposed by Vavilis et al. (Vavilis et al. 2014). The characteristics shown apply to the 8-dimensions (groups) of the taxonomic reputation, not social reputation scores.
| ID | Requirement | iSpot reputation system |
|---|---|---|
| R1 | Ratings should discriminate user behaviour. | Ratings are algorithmically awarded based upon weighted agreements and discriminate between user activity in different taxa (groups). |
| R2 | Reputation should discriminate user behaviour. | Reputation is a scaled function of ratings and hence discriminates user behaviour. |
| R3 | The reputation system should be able to discriminate “incorrect” ratings. | The reputation system only awards a Likely ID when its threshold is crossed and it selects better supported over less well supported names when more than one is proposed. |
| R4 | An entity should not be able to provide rating for itself. | Users proposing a name cannot ‘agree’ with themselves. |
| R5 | Aggregation of ratings should be meaningful. | Multiple agreements are the norm and represent the ratings of different users. There is no incentive for gratuitous agreement. |
| R6 | Reputation should be assessed using a sufficient amount of information. | Reputation is earned from all agreements and is cumulative within groups. |
| R7 | The reputation system should differentiate reputation information by the interaction it represents. | The 8 dimensions of the reputation system differentiate users by their taxonomic field of expertise. Social points are earned and aggregated separately from ID-based reputations. |
| R8 | Reputation should capture the evolution of user behavior. | Reputation, earned by contributing correct identifications is dynamic, but it can only increase. |
| R9 | Users should not gain advantage of their new status. | This requirement is contrary to the learning principle utilized in iSpot, which is that new users need to be encouraged with early rewards. In iSpot, it is easier to earn the first one or two reputation badges in a taxon than later ones. However, although it is easier to gain early status, relative ranks of users can only be changed by making valued contributions. |
| R10 | New users should not be penalized for their status. | New users are encouraged (See R9). |
| R11 | Users should not be able to directly modify ratings. | Users cannot directly modify their ratings. |
| R12 | Users should not be able to directly modify reputation values. | Users cannot directly modify their reputation. |
| R13 | Users should not be responsible for directly calculating their own reputation | Reputation is calculated algorithmically, not by users themselves. |
Cumulative usage statistics up until 30 June 2014 for the two iSpot platforms www.ispotnature.org (mainly UK observations, launched June 2009) and iSpot Southern Africa www.ispot.org.za (launched June 2012). Data available from the Dryad Digital Repository: http://doi.org/10.5061/dryad.r0005
| Number of | UK | % | ZA | % | Total | % |
|---|---|---|---|---|---|---|
| Unique visitors | 724,272 | - | 166,936 | - | 891,208 | - |
| Participants registered | 35,988 | - | 6,455 | - | 42,443 | - |
| Observations uploaded | 262,942 | - | 127,222 | - | 390,164 | - |
| Determinations made | 322,076 | - | 159,675 | - | 481,751 | - |
| Agreements given | 1,020,539 | - | 241,431 | - | 1,261,970 | - |
| Images uploaded | 423,501 | - | 287,776 | - | 711,277 | - |
| Registered participants adding an observation | 13,644 | 37.65% | 1,895 | 29% | 15,539 | 37% |
| Registered participants adding a determination | 8,408 | 23.20% | 1,555 | 24% | 9,963 | 23% |
| Registered participants adding an agreement | 5,870 | 16.20% | 1,087 | 17% | 6,957 | 16% |
Since June 2013 only.
Figure 7.a iSpot participants who made at least one observation, ranked on the horizontal axis by the number of observations each made (shown on the vertical axis). b iSpot participants who made at least one determination ranked on the horizontal axis by the number of identifications each made. n = 201,711 observations on ispotnature.org for both.
Figure 8.The global distribution of observations made on ispotnature.org and ispot.org.za up to December 2013.
Figure 9.For ispotnature.org in the 4-year period shown: a The percentage of observations submitted each month that received a likely ID and the percentage where an expert provided or agreed the determination b The number of observations submitted per month.
Figure 10.The cumulative frequency distribution for the time taken for a likely ID to be acquired by observations submitted to ispotnature.org without an organism name. n = 100,703 observations.
Taxonomic ranks of Likely IDs and of alternative names for a sample of 14,611 observations of UK species submitted to iSpot.
| Rank of alternative name | |||||||
|---|---|---|---|---|---|---|---|
| Rank of Likely ID | Phylum | Class | Order | Family | Genus | Species | Total |
| Phylum | 1 | 1 | - | 2 | 1 | 8 | 13 |
| Class | 2 | 3 | 1 | 6 | 12 | 51 | 75 |
| Order | - | 5 | 14 | 29 | 40 | 118 | 206 |
| Family | 4 | 10 | 68 | 86 | 128 | 441 | 737 |
| Genus | 4 | 18 | 118 | 200 | 295 | 1,807 | 2,442 |
| Species | 26 | 113 | 388 | 737 | 1,985 | 7,889 | 11,138 |
| Total | 37 | 150 | 589 | 1,060 | 2,461 | 10,314 | 14,611 |
Percentage of changes in taxonomic rank for the data shown in Table 3, with the inferred results for precision and accuracy.
| Initial taxonomic rank | |||||||
|---|---|---|---|---|---|---|---|
| Change in taxonomic rank (Inference) | Phylum | Class | Order | Family | Genus | Species | All ranks |
| Up (False precision in the alternative name) | - | 1% | 0% | 3% | 7% | 24% | 18% |
| None (Improved accuracy in the Likely ID) | 3% | 2% | 2% | 8% | 12% | 76% | 57% |
| Down (Improved precision in the Likely ID) | 97% | 97% | 97% | 88% | 81% | - | 25% |