| Literature DB >> 32185122 |
Dakila A Ledesma1, Caleb A Powell2, Joey Shaw2, Hong Qin1.
Abstract
PREMISE: Large-scale efforts to digitize herbaria have resulted in more than 18 million publicly available Plantae images on sites such as iDigBio. The automation of image post-processing will lead to time savings in the digitization of biological specimens, as well as improvements in data quality. Here, new and modified neural network methodologies were developed to automatically detect color reference charts (CRC), enabling the future automation of various post-processing tasks. METHODS ANDEntities:
Keywords: automation; digitization; herbarium; machine learning; natural history collections; specimen images
Year: 2020 PMID: 32185122 PMCID: PMC7073326 DOI: 10.1002/aps3.11331
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Figure 1Example herbarium sheet image containing a small color reference chart (CRC) and its evaluation using Faster R‐CNN. (A) A herbarium sheet image with a small CRC showing the difference in scale between the CRC and the total image size. (B) The same image as in A, but with moderate cropping. Faster R‐CNN was not able to find the CRC even with this cropping due to its small relative size. (C) The same image as in A, but with more cropping than in B. Faster R‐CNN was able to find the CRC with 88% confidence, but the predicted region (blue box) lacks precision and would fail our standards for a correct CRC identification.
Figure 2General ColorNet architecture. Specific details of each layer and their configuration may be found in Appendix S1. HSV: hue, saturation, value; MLP: multilayer perceptron; RGB: red, green, blue.
Figure 3Example CRC detection results (not to scale) using ColorNet (small CRCs) (A, B) and modified Faster R‐CNN (large CRCs) (C), and post‐processing high‐precision crop results using the CRC information (B). Examples of small CRC detection: (A) The output of the most probable partition to contain the CRC. From left to right are regular, skewed, and partially obstructed small CRCs found within our test data set. (B) Outputs similar to A, but with high‐precision cropping of RAW images that have not been white balanced or color corrected.
Region proposal results from each algorithm and model based on the randomly pulled 565 small CRC and 395 large CRC images from SERNEC.
| Algorithm/model | Image resolution (pixels) | Accuracy (%) | Intel Core‐i3 inference time per image (seconds) | Raspberry Pi 4 inference time per image (seconds) |
|---|---|---|---|---|
| Selective Search (Fast) | 1250 × 1875 | 100 | 39.90 | Not tested |
| Original Faster R‐CNN (small CRC) | 600 × 900 | 67.080 | 4.02 | 61.42 |
| Original Faster R‐CNN (large CRC) | 600 × 900 | 99.748 | 3.71 | 59.15 |
| ColorNet‐Quick (small CRC) | 1250 × 1875 | 95.221 | 0.59 | 2.88 |
| ColorNet‐Normal (small CRC) | 1250 × 1875 | 98.053 | 1.11 | 4.68 |
| Modified Faster R‐CNN (large CRC) | 600 × 900 | 99.241 | 1.45 | 7.08 |
From the start of the inference time to the end of inference time, not including load times of the images or processing of the image.
With the neural network resizing to proper dimensions (600 px on the shortest side) resulting in the processing of 600 × 900 px images.
Region proposal only. No classifier was used due to very poor speed. The poor speed would be compounded with a classifier assessing each individual region for a CRC.