| Literature DB >> 29944146 |
Fiona M Jones1, Campbell Allen2, Carlos Arteta3, Joan Arthur2, Caitlin Black1, Louise M Emmerson4, Robin Freeman5, Greg Hines2, Chris J Lintott2, Zuzana Macháčková2, Grant Miller2, Rob Simpson6, Colin Southwell4, Holly R Torsey2, Andrew Zisserman3, Tom Hart1.
Abstract
Automated time-lapse cameras can facilitate reliable and consistent monitoring of wild animal populations. In this report, data from 73,802 images taken by 15 different Penguin Watch cameras are presented, capturing the dynamics of penguin (Spheniscidae; Pygoscelis spp.) breeding colonies across the Antarctic Peninsula, South Shetland Islands and South Georgia (03/2012 to 01/2014). Citizen science provides a means by which large and otherwise intractable photographic data sets can be processed, and here we describe the methodology associated with the Zooniverse project Penguin Watch, and provide validation of the method. We present anonymised volunteer classifications for the 73,802 images, alongside the associated metadata (including date/time and temperature information). In addition to the benefits for ecological monitoring, such as easy detection of animal attendance patterns, this type of annotated time-lapse imagery can be employed as a training tool for machine learning algorithms to automate data extraction, and we encourage the use of this data set for computer vision development.Entities:
Mesh:
Year: 2018 PMID: 29944146 PMCID: PMC6018656 DOI: 10.1038/sdata.2018.124
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Comparison of counts between Penguin Watch (CS) and GS data for four cameras: DAMOa, HALFc, LOCKb and PETEc. ‘Threshold employed’ relates to the filtering threshold applied during analysis.
| Threshold employed | ||||||||
|---|---|---|---|---|---|---|---|---|
| Only | All | |||||||
| >1 | >2 | >3 | >4 | >1 | >2 | >3 | >4 | |
| For example, a threshold of ‘>2’ means that at least three people must have marked an area before it is counted as a penguin. ‘Average difference’ is the mean average of all differences between the GS and | ||||||||
| n=300 | n=300 | |||||||
| Average difference | 1.88 | 1.50 | 1.18 | 1.24 | 1.94 | 1.58 | 1.37 | 1.53 |
| σ | 2.15 | 1.78 | 1.56 | 1.40 | 2.22 | 1.86 | 1.69 | 1.61 |
| Proportion of differences that are 0 or 1 | 0.56 | 0.67 | 0.75 | 0.69 | 0.56 | 0.64 | 0.69 | 0.62 |
| n=283 | n=283 | |||||||
| Average difference | 1.28 | 0.99 | 0.88 | 0.99 | 1.45 | 1.27 | 1.34 | 1.55 |
| σ | 2.06 | 1.58 | 1.34 | 1.28 | 2.08 | 1.71 | 1.64 | 1.72 |
| Proportion of differences that are 0 or 1 | 0.74 | 0.82 | 0.85 | 0.82 | 0.69 | 0.71 | 0.68 | 0.64 |
| n=300 | n=300 | |||||||
| Average difference | 1.41 | 1.25 | 1.17 | 1.19 | 1.40 | 1.24 | 1.18 | 1.29 |
| σ | 2.31 | 2.12 | 1.98 | 1.96 | 2.30 | 1.89 | 1.74 | 1.78 |
| Proportion of differences that are 0 or 1 | 0.73 | 0.77 | 0.77 | 0.75 | 0.72 | 0.73 | 0.73 | 0.68 |
| n=300 | n=300 | |||||||
| Average difference | 3.60 | 2.48 | 2.36 | 3.27 | 4.29 | 3.54 | 4.37 | 6.06 |
| σ | 3.50 | 2.99 | 2.69 | 2.86 | 4.51 | 4.07 | 4.41 | 5.20 |
| Proportion of differences that are 0 or 1 | 0.28 | 0.46 | 0.46 | 0.30 | 0.25 | 0.37 | 0.33 | 0.20 |
Comparison of chick counts between Penguin Watch (CS) and GS data for four cameras: DAMOa, HALFc, LOCKb and PETEc; see Table 1 legend for definitions.
| Threshold employed | ||||||||
|---|---|---|---|---|---|---|---|---|
| Only | Only | |||||||
| >1 | >2 | >3 | >4 | >1 | >2 | >3 | >4 | |
| The first column shows the results for all images in the sample, including those where – according to the GS and CS data – no chicks were present. The second column presents the results for the sample of images where chicks were present, according to the GS and/or CS classifications. When extracting | ||||||||
| n=300 | n=60 | |||||||
| Average difference | 0.25 | 0.27 | 0.29 | 0.33 | 1.25 | 1.33 | 1.45 | 1.65 |
| σ | 0.68 | 0.71 | 0.75 | 0.85 | 1.04 | 1.05 | 1.06 | 1.20 |
| Proportion of differences that are 0 or 1 | 0.94 | 0.93 | 0.92 | 0.91 | 0.68 | 0.67 | 0.60 | 0.53 |
| n=283 | n=109 | |||||||
| Average difference | 0.46 | 0.51 | 0.58 | 0.65 | 1.18 | 1.31 | 1.50 | 1.69 |
| σ | 0.89 | 0.99 | 1.11 | 1.24 | 1.09 | 1.22 | 1.35 | 1.50 |
| Proportion of differences that are 0 or 1 | 0.90 | 0.87 | 0.83 | 0.82 | 0.73 | 0.65 | 0.57 | 0.54 |
| n=300 | n=122 | |||||||
| Average difference | 0.94 | 0.90 | 0.88 | 0.92 | 2.32 | 2.22 | 2.16 | 2.25 |
| σ | 2.21 | 2.08 | 2.07 | 2.12 | 2.97 | 2.79 | 2.79 | 2.84 |
| Proportion of differences that are 0 or 1 | 0.81 | 0.81 | 0.83 | 0.80 | 0.53 | 0.53 | 0.59 | 0.52 |
| n=300 | n=179 | |||||||
| Average difference | 1.89 | 2.07 | 2.49 | 3.06 | 3.17 | 3.46 | 4.17 | 5.12 |
| σ | 2.69 | 2.69 | 3.12 | 3.70 | 2.85 | 2.71 | 3.06 | 3.51 |
| Proportion of differences that are 0 or 1 | 0.60 | 0.55 | 0.52 | 0.49 | 0.33 | 0.25 | 0.20 | 0.15 |
Percentage of Penguin Watch (CS) classifications that are greater than (GS < CS), less than (GS > CS) or equal to (GS = CS) gold standard (GS) classifications (i.e. overestimates, underestimates or matches) for DAMOa, HALFc, LOCKb and PETEc.
| Threshold employed | ||||||||
|---|---|---|---|---|---|---|---|---|
| Only | Only | |||||||
| >1 | >2 | >3 | >4 | >1 | >2 | >3 | >4 | |
| Results are shown for adult classifications (all images) and chick classifications (only images where chicks are present, according to GS and/or CS data). | ||||||||
| n=300 | n=60 | |||||||
| GS<CS (%) | 67.00 | 51.67 | 33.67 | 18.00 | 15.00 | 10.00 | 3.33 | 3.33 |
| GS>CS (%) | 9.33 | 22.00 | 33.33 | 51.33 | 61.67 | 71.67 | 81.67 | 86.67 |
| GS=CS (%) | 23.67 | 22.33 | 33.00 | 30.67 | 23.33 | 18.33 | 15.00 | 10.00 |
| n=283 | n=109 | |||||||
| GS<CS (%) | 48.06 | 35.34 | 12.61 | 14.49 | 17.43 | 11.01 | 2.75 | 0.92 |
| GS>CS (%) | 12.72 | 20.14 | 32.16 | 48.76 | 57.80 | 63.30 | 73.39 | 79.82 |
| GS=CS (%) | 39.22 | 44.52 | 45.23 | 36.75 | 24.78 | 25.69 | 23.85 | 19.27 |
| n=300 | n=122 | |||||||
| GS<CS (%) | 48.67 | 37.67 | 27.33 | 22.33 | 27.05 | 23.77 | 17.21 | 9.84 |
| GS>CS (%) | 11.00 | 18.33 | 26.67 | 32.00 | 51.64 | 54.92 | 63.11 | 66.39 |
| GS=CS (%) | 40.33 | 44.00 | 46.00 | 45.67 | 21.31 | 21.31 | 19.67 | 23.77 |
| n=300 | n=179 | |||||||
| GS<CS (%) | 79.33 | 55.33 | 26.33 | 13.67 | 32.96 | 20.67 | 10.06 | 6.70 |
| GS>CS (%) | 13.00 | 30.67 | 54.00 | 77.00 | 58.66 | 72.07 | 83.24 | 91.60 |
| GS=CS (%) | 7.67 | 14 | 19.67 | 9.33 | 8.38 | 7.26 | 6.70 | 1.68 |
Raw Images – Metadata and File Information.
| Site (camera location) | Geographic coordinates | Camera name | Image set (name in repository) | File information and camera settings |
|---|---|---|---|---|
| Metadata and file information associated with each of the 73,802 ‘Raw images’ included in the Dryad Digital Repository (Data Citation 1). These images are stored under the folders: DAMO to MAIV, NEKO, PETE.1, PETE.2 and SPIG.Images retain their original dimensions (either 1920×1080 pixels or 2048×1536 pixels), but have been renamed. The | ||||
| Damoy Point, Weincke Island, Antarctic Peninsula | 64.82° S, 63.49° W | DAMOa | DAMOa2014a | No. of files: 407Time range: 26/12/2013 1300–22/01/2014 1400Settings: 0700–2100; hourly |
| George’s Point, Rongé Island on the Errera Channel, Antarctic Peninsula | 64.67° S, 62.67° W | GEORa | GEORa2013 | No. of files: 4403Time range: 22/12/2012 1000–24/12/2013 0800Settings: 0800–1900; hourly |
| Half Moon Island, South Shetland Islands | 62.60° S, 59.90° W | HALFb | HALFb2013a | No. of files: 306Time range: 15/12/2012 1700–08/01/2013 1000Settings: 0700–1900; hourly |
| HALFc | HALFc2013a | No. of files: 283Time range: 21/12/2012 1700–09/01/2013 1400Settings: 0700–2100; hourly | ||
| Port Lockroy, Wiencke Island, Antarctic Peninsula | 64.82° S, 63.49° W | LOCKb | LOCKb2013 | No. of files: 2372Time range: 13/12/2012 1100–31/05/2013 1600Settings: 0700–2000; hourly |
| Maiviken, South Georgia | 54.24° S, 36.50° W | MAIVb | MAIVb2012a | No. of files: 660Time range: 13/10/2012 0900–03/01/2013 1300Settings: 0900–1100, 1300–1700; hourly |
| MAIVb2013a | No. of files: 3878Time range: 03/01/2013 1400–28/10/2013 1700Settings: 0800–2000; hourly | |||
| MAIVb2013c | No. of files: 137Time range: 29/10/2013 0800–08/11/2013 1400Settings: 0800–2000; hourly | |||
| MAIVc | MAIVc2013 | No. of files: 4661Time range: 15/10/2012 0900–08/11/2013 1300Settings: 0900–1100, 1300–1700; hourly | ||
| Neko Harbour, Andvord Bay, Antarctic Peninsula | 64.86° S, 62.52° W | NEKOa | NEKOa2012a | No. of files: 1608Time range: 03/03/2012 1000–25/11/2012 1500Settings: 1000–1500; hourly |
| NEKOa2013 | No. of files: 3552Time range: 26/11/2012 1000–25/12/2013 1500Settings: 0800–1600; hourly | |||
| NEKOa2014a | No. of files: 239Time range: 25/12/2013 1600–13/01/2014 0700Settings: 0700–1900; hourly | |||
| NEKOb | NEKOb2013 | No. of files: 6449Time range: 02/03/2012 1600–26/11/2012 0800Settings: 0000–1200; hourly | ||
| NEKOc | NEKOc2013 | No. of files: 4415Time range: 14/12/2012 1600–19/11/2013 1000Settings: 0700–1900; hourly | ||
| NEKOc2014b | No. of files: 296Time range: 25/12/2013 1600–13/01/2014 0700Settings: 0600-2100; hourly | |||
| Petermann Island, Antarctic Peninsula | 65.17° S, 64.14° W | PETEc | PETEc2013 | No. of files: 506Time range: 03/12/2012 1600–11/01/2013 1400Settings: 0800–1100, 1300–2100; hourly |
| PETEc2014 | No. of files: 10,732Time range: 11/01/2013 1430–04/01/2014 1000Settings: 0700–2130; half hourly | |||
| PETEd | PETEd2013 | No. of files: 10,732Time range: 11/01/2013 1430–04/01/2014 1000Settings: 0700–2130; half hourly | ||
| PETEe | PETEe2013 | No. of files: 3874Time range: 12/12/2012 1600–04/01/2014 0900Settings: 0900–1800; hourly | ||
| PETEf | PETEf2014a | No. of files: 8271Time range: 24/12/2012 1300–04/01/2014 1000Settings: 0100–1100, 1300–2300; hourly | ||
| Spigot Peak, Orne Harbour, Antarctic Peninsula | 64.63° S, 62.57° W | SPIGa | SPIGa2012a | No. of files: 390Time range: 25/11/2012 1700–21/12/2012 1600Settings: 0700 -2100; hourly |
| SPIGa2013b | No. of files: 5221Time range: 09/01/2013 1500–23/12/2013 1500Settings: 0700–2100; hourly | |||
| SPIGa2014 | No. of files: 410Time range: 23/12/2013 1600–21/01/2014 2000Settings: 0700–2000; hourly | |||
| Total no. of files: 73,802 |
Figure 1Two examples of remote camera structures.
(a) A wire rock basket, covered by further rocks, is used to support the metal scaffold pole (Orne Harbour, Antarctic Peninsula); (b) multiple metal “legs” are fastened to the main structure for support; each “foot” is secured using rocks (Aitcho Islands, South Shetland Islands, Antarctic Peninsula). The design shown in (b) is favoured for future constructions as the “legs” provide increased stability and are longer-lasting than the wire used in (a), which becomes brittle after approximately three years. The cameras shown here are powered by internal batteries. Photo credit: FMJ.
Figure 2The Penguin Watch volunteer work flow.
If animals are present in a given image, volunteers are asked to tag individuals by clicking on them, and classify them as ‘adult’, ‘chick’, ‘egg’, or ‘other’ (the latter can be used to identify other fauna, ships or humans). Once an image has been classified, volunteers are given the opportunity to ‘talk’ about it on a Penguin Watch forum. Green boxes indicate that volunteers must supply an answer, purple boxes indicate that a process must be carried out (such as clicking on penguins). Image source: https://www.zooniverse.org/lab.
Figure 3Image HALFb2013a_000051.JPG, with the ‘raw clicks’ of Penguin Watch volunteers overlaid.
Each dot represents a single click, with colours specific to ten individual volunteers (images said to contain animals are shown to ten volunteers by default). Using the clustering algorithm (see ‘Code Availability’), ‘consensus clicks’ are derived from each group of markings. The coordinates of ‘raw clicks’ and ‘consensus clicks’ can be found in the Dryad Digital Repository (Tables 5 and 6; Data Citation 1).
Penguin Watch Consensus Click Data.
| Camera name | Corresponding image set (see | File name (in repository) |
|---|---|---|
| List of ‘PW Anonymised Raw Classifications and Metadata’ files stored in the Dryad Digital Repository (Data Citation 1), and their associated cameras/image files. The number of unique | ||
| DAMOa | DAMOa2014a | DAMOa2014a_concl.csv |
| GEORa | GEORa2013 | GEORa2013a_concl.csv |
| GEORa2013b_concl.csv | ||
| HALFb | HALFb2013a | HALFb2013a_concl.csv |
| HALFc | HALFc2013a | HALFc2013a_concl.csv |
| LOCKb | LOCKb2013 | LOCKb2013a_concl.csv |
| LOCKb2013b_concl.csv | ||
| MAIVb | MAIVb2012a | MAIVb2012a_concl.csv |
| MAIVb2013a | MAIVb2013a_concl.csv | |
| MAIVb2013c | MAIVb2013c_concl.csv | |
| MAIVc | MAIVc2013 | MAIVc2013_concl.csv |
| NEKOa | NEKOa2012a | NEKOa2012a_concl.csv |
| NEKOa2013 | NEKOa2013a_concl.csv | |
| NEKOa2013b_concl.csv | ||
| NEKOa2013c_concl.csv | ||
| NEKOa2014a | NEKOa2014a_concl.csv | |
| NEKOb | NEKOb2013 | NEKOb2013_concl.csv |
| NEKOc | NEKOc2013 | NEKOc2013a_concl.csv |
| NEKOc2013b_concl.csv | ||
| NEKOc2013c_concl.csv | ||
| NEKOc2014b | NEKOc2014b_concl.csv | |
| PETEc | PETEc2013 | PETEc2013a_concl.csv |
| PETEc2013b_concl.csv | ||
| PETEc2014 | PETEc2014a_concl.csv | |
| PETEc2014b_concl.csv | ||
| PETEd | PETEd2013 | PETEd2013a_concl.csv |
| PETEd2013b_concl.csv | ||
| PETEe | PETEe2013 | PETEe2013a_concl.csv |
| PETEe2013b_concl.csv | ||
| PETEf | PETEf2014a | PETEf2014a_concl.csv |
| SPIGa | SPIGa2012a | SPIGa2012a_concl.csv |
| SPIGa2013b | SPIGa2013b_concl.csv | |
| SPIGa2014 | SPIGa2014a_concl.csv | |
| SPIGa2014b_concl.csv |
Figure 4A sample image (DAMOa2014a_000028.JPG), taken by a Reconyx time-lapse camera at Damoy Point, Weincke Island, Antarctic Peninsula (64.82° S, 63.49° W).
Date, time, moon phase and temperature information is shown at the top of the image. This is one of 73,802 photographs that can be found online in the Dryad Digital Repository (Data Citation 1).
Penguin Watch Anonymised Raw Classifications and Metadata.
| Camera name | Corresponding image set (see | File name (in repository) | Number of unique annotators |
|---|---|---|---|
| List of ‘PW Anonymised Raw Classifications and Metadata’ files stored in the Dryad Digital Repository (Data Citation 1), and their associated cameras/image files. The number of unique | |||
| DAMOa | DAMOa2014a | DAMOa2014a_metadata.csv | 3042 |
| GEORa | GEORa2013 | GEORa2013a_metadata.csv | 1810 |
| GEORa2013b_metadata.csv | 13,353 | ||
| HALFb | HALFb2013a | HALFb2013a_metadata.csv | 2433 |
| HALFc | HALFc2013a | HALFc2013a_metadata.csv | 2211 |
| LOCKb | LOCKb2013 | LOCKb2013a_metadata.csv | 1286 |
| LOCKb2013b_metadata.csv | 8959 | ||
| MAIVb | MAIVb2012a | MAIVb2012a_metadata.csv | 3977 |
| MAIVb2013a | MAIVb2013a_metadata.csv | 11,367 | |
| MAIVb2013c | MAIVb2013c_metadata.csv | 1177 | |
| MAIVc | MAIVc2013 | MAIVc2013_metadata.csv | 13,543 |
| NEKOa | NEKOa2012a | NEKOa2012a_metadata.csv | 6874 |
| NEKOa2013 | NEKOa2013a_metadata.csv | 705 | |
| NEKOa2013b_metadata.csv | 11,068 | ||
| NEKOa2013c_metadata.csv | 2411 | ||
| NEKOa2014a | NEKOa2014a_metadata.csv | 1831 | |
| NEKOb | NEKOb2013 | NEKOb2013_metadata.csv | 15,789 |
| NEKOc | NEKOc2013 | NEKOc2013a_metadata.csv | 28 |
| NEKOc2013b_metadata.csv | 1019 | ||
| NEKOc2013c_metadata.csv | 13,523 | ||
| NEKOc2014b | NEKOc2014b_metadata.csv | 2212 | |
| PETEc | PETEc2013 | PETEc2013a_metadata.csv | 2033 |
| PETEc2013b_metadata.csv | 1926 | ||
| PETEc2014 | PETEc2014a_metadata.csv | 21,009 | |
| PETEc2014b_metadata.csv | 4492 | ||
| PETEd | PETEd2013 | PETEd2013a_metadata.csv | 21,280 |
| PETEd2013b_metadata.csv | 4594 | ||
| PETEe | PETEe2013 | PETEe2013a_metadata.csv | 981 |
| PETEe2013b_metadata.csv | 12,469 | ||
| PETEf | PETEf2014a | PETEf2014a_metadata.csv | 18,461 |
| SPIGa | SPIGa2012a | SPIGa2012a_metadata.csv | 2419 |
| SPIGa2013b | SPIGa2013b_metadata.csv | 12,435 | |
| SPIGa2014 | SPIGa2014a_metadata.csv | 1658 | |
| SPIGa2014b_metadata.csv | 1235 |
Figure 5Percentage of Penguin Watch (CS) classifications that are greater than (GS < CS), less than (GS > CS) or equal to (GS = CS) gold standard (GS) classifications (i.e. overestimates, underestimates or matches) for DAMOa.
Top: Adult classifications only (n=300); bottom: chick classifications only (for images where chicks are present according to GS and/or CS data; n=60).