| Literature DB >> 27932915 |
Gordon Paterson1, Sara Albuquerque1, Vladimir Blagoderov1, Stephen Brooks1, Steve Cafferty1, Elisa Cane1, Victoria Carter1, John Chainey1, Robyn Crowther1, Lyndsey Douglas1, Joanna Durant1, Liz Duffell1, Adrian Hine1, Martin Honey1, Blanca Huertas1, Theresa Howard1, Rob Huxley1, Ian Kitching1, Sophie Ledger1, Caitlin McLaughlin1, Geoff Martin1, Gerardo Mazzetta1, Malcolm Penn1, Jasmin Perera1, Mike Sadka1, Elisabetta Scialabba1, Angela Self1, Darrell J Siebert1, Chris Sleep1, Flavia Toloni1, Peter Wing1.
Abstract
BACKGROUND: The Natural History Museum, London (NHMUK) has embarked on an ambitious programme to digitise its collections . The first phase of this programme has been to undertake a series of pilot projects that will develop the necessary workflows and infrastructure development needed to support mass digitisation of very large scientific collections. This paper presents the results of one of the pilot projects - iCollections. This project digitised all the lepidopteran specimens usually considered as butterflies, 181,545 specimens representing 89 species from the British Isles and Ireland. The data digitised includes, species name, georeferenced location, collector and collection date - the what, where, who and when of specimen data. In addition, a digital image of each specimen was taken. This paper explains the way the data were obtained and the background to the collections which made up the project. NEW INFORMATION: Specimen-level data associated with British and Irish butterfly specimens have not been available before and the iCollections project has released this valuable resource through the NHM data portal.Entities:
Keywords: British Isles; Collection digitisation; Ireland; butterflies; collections; mass digitsation
Year: 2016 PMID: 27932915 PMCID: PMC5136670 DOI: 10.3897/BDJ.4.e9559
Source DB: PubMed Journal: Biodivers Data J ISSN: 1314-2828
Figure 1.The work packages associated with the iCollections project
Figure 2.Image of a specimen in a unit tray ready to be imaged as part of the digitisation process.
Figure 3.Workflow associated with the iCollections digitisation project.
Figure 4.The label problem.
a) and b) Transcription of ambiguous dates as exemplified by two handwritten labels from the locality (town) of Ruislip pertaining to different specimens; the middle number (month) on the date from the left label could be interpreted as February or November, depending on whether the number was written as a Roman numeral or not. The label on the right (b) has been handwritten by the same person and shows unequivocally that the middle number is indeed a Roman numeral but this finding also suggests that the collector also wrote 11 in the other label. c), d), e) and f) Transcription of emergence and collection dates. (c) Collection and emergence dates clearly stated on the handwritten label. (d) Emergence date inferred by known symbols (e.g., crosses and asterisks) and abbreviations or initials (e.g., ‘B.’ standing for ‘Bred’, ‘l.’ standing for ‘larva’), (e) ‘o’ standing for ‘ova’, or (f) ‘R.’ standing for ‘reared’. g) and h) Transcription of faded information from (g) barely legible label produced with a mimeograph; (h) A similar type of label in a better conservation state. i) and j) Transcription of a barely legible, smudged number (corresponding to the year that the specimen was collected) from a printed label was easily performed by zooming in its corresponding image, as shown in the enlarged detail on the right. It became apparent that the stamped number was 96, and consequently the collection year was assumed to be 1896 (century inferred based on known period of activity for W.M. Reid).
Figure 5.Erroneous labels
Figure 6.Label transcription interface.
Figure 7.Taxon standardisation inferface.
Figure 8.Georeferencing site variant harmonisation interface.
Figure 9.Georeferencing interface
Figure 10.Map giving the specimen distribution within the collections. a- Home Counties around London; b – New Forest; c – Torbay region and d) Lake District. The latter two localities became popular holiday destinations during the late 1800s and early 1900s.
Figure 11.Chart showing the percentage of specimens belonging to the different families within the British and Irish collection.
Table 1. Species in the NHMUK British and Irish butterfly collections together with counts of the numbers of specimens of each species.
|
|
|
|
|
| 56.002 | 1 | ||
| 56.003 | 810 | ||
| 56.0031 | 1 | UK data | |
| 57.001 | 1215 | ||
| 57.0012 | 3 | UK data | |
| 57.002 | 1673 | ||
| 57.0021 | 1 | UK data | |
| 57.004 | 950 | ||
| 57.005 | 854 | ||
| 57.006 | 1250 | ||
| 57.007 | 711 | ||
| 57.008 | 875 | ||
| 57.009 | 1249 | ||
| 58.001 | 1763 | Irish specimens not dissected. Recently split into two species the collection has yet to be reidentified. | |
| 58.002 | 4 | ||
| 58.003 | 2866 | ||
| 58.005 | 451 | ||
| 58.006 | 2281 | ||
| 58.007 | 2413 | ||
| 58.008 | 8164 | ||
| 58.009 | 160 | ||
| 58.01 | 2749 | ||
| 58.011 | 662 | ||
| 58.012 | 65 | ||
| 58.0121 | 1 | UK data | |
| 58.013 | 884 | ||
| 58.014 | 4 | 4 UK data | |
| 59.001 | 24 | ||
| 59.002 | 1907 | ||
| 59.003 | 3037 | ||
| 59.004 | 5293 | ||
| 59.005 | 4008 | ||
| 59.006 | 5 | 3 UK data, 2 No data | |
| 59.007 | 1422 | ||
| 59.008 | 1734 | ||
| 59.009 | 3686 | ||
| 59.01 | 7118 | ||
| 59.011 | 3016 | ||
| 59.012 | 2706 | ||
| 59.013 | 2878 | ||
| 59.0131 | 1 | UK data | |
| 59.0132 | 1 | UK data | |
| 59.0137 | 1 | UK data | |
| 59.0138 | 1 | UK data | |
| 59.014 | 2697 | ||
| 59.015 | 3097 | ||
| 59.0151 | 1 | No data | |
| 59.016 | 119 | ||
| 59.017 | 2164 | ||
| 59.019 | 1882 | ||
| 59.02 | 1463 | ||
| 59.0201 | 4 | 3 UK, 1 no data; Agassiz number B36 | |
| 59.021 | 1169 | ||
| 59.022 | 566 | ||
| 59.0221 | 1 | UK data | |
| 59.0221 | 1 | UK data | |
| 59.023 | 1268 | ||
| 59.024 | 957 | ||
| 59.025 | 2 | ||
| 59.0251 | 1 | UK data | |
| 59.0252 | 1 | UK data | |
| 59.026 | 1280 | ||
| 59.027 | 3604 | ||
| 59.028 | 134 | ||
| 59.029 | 776 | ||
| 59.03 | 1 | ||
| 59.031 | 2044 | ||
| 59.032 | 10 | UK data | |
| 59.0321 | 1 | UK data | |
| 59.033 | 6423 | ||
| 59.034 | 1870 | ||
| 59.036 | 2253 | ||
| 60.001 | 1710 | ||
| 61.001 | 5996 | ||
| 61.002 | 1098 | 345 UK | |
| 61.0021 | 7 | 6 No data, 1 UK data; Agassiz number B40 | |
| 61.0022 | 8 | 3 UK, 5 no data; Agassiz number B39 | |
| 61.003 | 1199 | ||
| 61.004 | 1197 | ||
| 61.005 | 1425 | ||
| 61.006 | 990 | ||
| 61.007 | 893 | ||
| 61.0071 | 1 | UK data, (2 specimens in original list) | |
| 61.0071 | 2 | UK data | |
| 61.008 | 34 | ||
| 61.0081 | 3 | ||
| 61.01 | 1937 | ||
| 61.011 | 4 | 2 No data, 2 UK | |
| 61.012 | 2298 | ||
| 61.013 | 1827 | ||
| 61.01305 | 1 | UK data | |
| 61.0131 | 2 | 1 UK data, 1 No data | |
| 61.014 | 9094 | ||
| 61.0141 | 124 | No locality data hence specimens could not be assigned to either | |
| 61.015 | 2512 | ||
| 61.016 | 1836 | ||
| 61.0161 |
| 1637 | Bred hybrids |
| 61.017 | 229 | ||
| 61.018 | 12495 | ||
| 61.0181 | 2 | 2 UK data; Agassiz number B43 | |
| 61.019 | 8592 | ||
| 61.02 | 21702 | ||
| 61.0201 |
| 3 | Naturally occurring hybrids |
Figure 12.Growth of collections with time.
Figure 13.The graph indicates the when during the year specimens of the Orange Tip () butterfly were collected. The x- axis shows the day of the year while the y-axis indicates the number of specimens collected.
| Column label | Column description |
|---|---|
| GBIF | GBIF quality status |
| Catalogue | NHMUK catalogue number |
| Scientific name | Genus and species name |
| Author | Author name and date |
| Type status | Type status |
| Locality | Geographic locality |
| Country | Country |
| Records | Collector |
| Collection | Which NHMUK collection specimen record is from |
| Class | Taxonomic class |
| Family | Lepidopteran family |
| Genus | Genus |
| Species | Species |
| Subspecies | Subspecies |
| Project | Project title |