| Literature DB >> 33968496 |
Caleb Powell1, Alaina Krakowiak1, Rachel Fuller1, Erica Rylander1, Emily Gillespie2, Shawn Krosnick3, Brad Ruhfel4, Ashley B Morris5, Joey Shaw1.
Abstract
PREMISE: Herbaria are invaluable sources for understanding the natural world, and in recent years there has been a concerted effort to digitize these collections. To organize such efforts, a method for estimating the necessary labor is desired. This work analyzes digitization productivity reports of 105 participants from eight herbaria, deriving generalized labor estimates that account for human experience. METHODS ANDEntities:
Keywords: biodiversity data; digitization rates; herbaria; natural history collections
Year: 2021 PMID: 33968496 PMCID: PMC8085955 DOI: 10.1002/aps3.11415
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
The collections affiliated with this study and their contributions. Factors that may contribute to unequal specimen counts are: incomplete reporting, data cleaning, or workflow differences such as existing progress from previous efforts.
| Collection | Collection code | Participants | Total hours | Barcode application (Specimens) | Imaging (Specimens) | Skeletal data entry (Specimens) |
|---|---|---|---|---|---|---|
| Berea College, Ralph L. Thompson Herbarium | BEREA | 7 | 237 | 51 | 22,754 | 22,105 |
| East Tennessee State University | ETSU | 7 | 127 | 9861 | 9841 | 4355 |
| Middle Tennessee State University | MTSU | 17 | 875 | 3348 | 19,438 | 73,639 |
| Rhodes College | SWMT | 5 | 174 | 4881 | 5473 | 15,789 |
| Tennessee Technological University | HTTU | 25 | 520 | 18,053 | 12,658 | 15,950 |
| University of Tennessee at Chattanooga | UCHT | 42 | 1004 | 51,351 | 39,997 | 34,478 |
| University of Tennessee, Knoxville | TENN | 36 | 2960 | 174,118 | 195,797 | 175,966 |
| University of Tennessee at Martin | UTM | 2 | 49 | 1810 | 2150 | 2736 |
| University of the South | UOS | 3 | 8 | 525 | 423 | 661 |
Labor estimates for digitization tasks for multiple specimen counts across multiple contract duration hours.
| Contract duration (hours) | Task name | Specimen count | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10,000 | 20,000 | 30,000 | 40,000 | 50,000 | 75,000 | 100,000 | 125,000 | 150,000 | 200,000 | 250,000 | 300,000 | 500,000 | ||
| 15 | Barcoding | 39 | 77 | 115 | 153 | 191 | 287 | 382 | 477 | 573 | 763 | 954 | 1145 | 1907 |
| Databasing | 61 | 121 | 181 | 241 | 301 | 451 | 601 | 751 | 901 | 1201 | 1501 | 1801 | 3001 | |
| Imaging | 79 | 158 | 236 | 315 | 394 | 590 | 787 | 983 | 1179 | 1572 | 1965 | 2358 | 3929 | |
| Combined | 179 | 356 | 532 | 709 | 886 | 1328 | 1770 | 2211 | 2653 | 3536 | 4420 | 5304 | 8837 | |
| 30 | Barcoding | 37 | 72 | 108 | 143 | 178 | 266 | 355 | 444 | 532 | 709 | 886 | 1063 | 1770 |
| Databasing | 57 | 113 | 169 | 225 | 281 | 419 | 560 | 699 | 838 | 1118 | 1398 | 1676 | 2793 | |
| Imaging | 75 | 147 | 221 | 294 | 367 | 550 | 733 | 915 | 1098 | 1463 | 1828 | 2194 | 3656 | |
| Combined | 169 | 332 | 498 | 662 | 826 | 1235 | 1648 | 2058 | 2468 | 3290 | 4112 | 4933 | 8219 | |
| 45 | Barcoding | 35 | 71 | 106 | 141 | 175 | 262 | 349 | 436 | 524 | 698 | 873 | 1047 | 1743 |
| Databasing | 54 | 107 | 159 | 211 | 263 | 394 | 525 | 655 | 786 | 1047 | 1306 | 1568 | 2612 | |
| Imaging | 71 | 138 | 208 | 275 | 344 | 515 | 685 | 855 | 1027 | 1369 | 1709 | 2053 | 3418 | |
| Combined | 160 | 316 | 473 | 627 | 782 | 1171 | 1559 | 1946 | 2337 | 3114 | 3888 | 4668 | 7773 | |
| 60 | Barcoding | 35 | 74 | 107 | 145 | 182 | 272 | 364 | 453 | 545 | 726 | 908 | 1089 | 1814 |
| Databasing | 51 | 102 | 151 | 200 | 247 | 370 | 493 | 616 | 739 | 985 | 1230 | 1475 | 2454 | |
| Imaging | 66 | 131 | 196 | 260 | 325 | 482 | 645 | 806 | 964 | 1287 | 1607 | 1927 | 3212 | |
| Combined | 152 | 307 | 454 | 605 | 754 | 1124 | 1502 | 1875 | 2248 | 2998 | 3745 | 4491 | 7480 | |
| 90 | Barcoding | 35 | 80 | 119 | 159 | 203 | 302 | 403 | 504 | 609 | 819 | 1020 | 1221 | 2040 |
| Databasing | 51 | 90 | 141 | 180 | 231 | 341 | 450 | 567 | 680 | 899 | 1129 | 1348 | 2246 | |
| Imaging | 64 | 124 | 177 | 240 | 300 | 442 | 593 | 739 | 884 | 1176 | 1474 | 1767 | 2941 | |
| Combined | 150 | 294 | 437 | 579 | 734 | 1085 | 1446 | 1810 | 2173 | 2894 | 3623 | 4336 | 7227 | |
| 135 | Barcoding | 35 | 80 | 135 | 170 | 214 | 323 | 440 | 558 | 673 | 887 | 1114 | 1346 | 2233 |
| Databasing | 51 | 90 | 129 | 179 | 218 | 327 | 432 | 532 | 641 | 858 | 1064 | 1282 | 2127 | |
| Imaging | 64 | 114 | 175 | 227 | 281 | 422 | 561 | 701 | 840 | 1119 | 1397 | 1674 | 2782 | |
| Combined | 150 | 284 | 439 | 576 | 713 | 1072 | 1433 | 1791 | 2154 | 2864 | 3575 | 4302 | 7142 | |
Figure 1The average technician skeletal databasing rate (specimen/minute) as a function of cumulative hours performing skeletal databasing. The mean rate at each two‐hour bin is indicated by the blue point, and the number of data points informing the mean is annotated over each point. The range of values at each bin are indicated by vertical bars.
Figure 2The average technician imaging rate (specimen/minute) as a function of cumulative hours imaging. The mean rate at each two‐hour bin is indicated by the blue point, and the number of data points informing the mean is annotated over each point. The range of values at each bin are indicated by vertical bars.
Figure 3The average technician barcode application rate (specimen/minute) as a polynomial function of cumulative hours applying barcodes. The mean rate at each two‐hour bin is indicated by the blue point, and the number of data points informing the mean is annotated over each point. The range of values at each bin are indicated by vertical bars.