| Literature DB >> 30937225 |
Titouan Lorieul1,2, Katelin D Pearson3, Elizabeth R Ellwood4, Hervé Goëau5,6, Jean-Francois Molino5, Patrick W Sweeney7, Jennifer M Yost8, Joel Sachs9, Erick Mata-Montero10, Gil Nelson11, Pamela S Soltis12, Pierre Bonnet5,6, Alexis Joly2.
Abstract
PREMISE OF THE STUDY: Phenological annotation models computed on large-scale herbarium data sets were developed and tested in this study.Entities:
Keywords: convolutional neural network; deep learning; herbarium data; natural history collections; phenological stage annotation; visual data classification
Year: 2019 PMID: 30937225 PMCID: PMC6426157 DOI: 10.1002/aps3.1233
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Description of data sets used on EXP1‐Fertility, EXP2‐Fl.Fr, and EXP3‐Pheno.a
| Full data set names | Data set acronyms | No. of herbarium specimens | Fertile proportion | Flower proportion | Fruit proportion | No. of families | No. of genera | No. of species |
|---|---|---|---|---|---|---|---|---|
| New England Vascular Plant specimens | NEVP | 42,658 | 90.9% | 64.9% | 34.9% | 16 | 340 | 1375 |
| Florida State University's Robert K. Godfrey Herbarium | FSU | 54,263 | 92.7% | 73.9% | 55.2% | 202 | 1189 | 3870 |
| IRD Herbarium of Cayenne | CAY | 66,312 | 79.4% | 46.6% | 35.1% | 126 | 764 | 3024 |
| Asteraceae phenophase data set | PHENO | 20,994 | 100% | NA | NA | 1 | 16 | 139 |
IRD = Institut de Recherche pour le Développement; NA = not available.
See Appendices 2 and 3 for the lists of institutions contributing data to the NEVP and PHENO data sets.
Figure 1Illustration of the different phenological stages of Tilia americana on the NEVP herbarium data set. (A) Non‐fertile specimen, (B) specimen with open flowers, (C) specimen with ripe fruits.
Phenophases assigned to specimens in the PHENO data set with percentages of reproductive structures on a specimen that are closed buds, flowers, and fruits
| Phenophase code | Phenophase description | Distribution |
|---|---|---|
| 1 | Specimen with unopened flowers | 100% closed buds |
| 2 | Specimen mainly in buds | 75% closed buds, 25% flowers, 0% fruits 75% closed buds, 0% flowers, 25% fruits |
| 3 | Specimen essentially in buds and flowers | 50% closed buds, 50% flowers, 0% fruits |
| 4 | Specimen mainly in buds and flowers | 50% closed buds, 25% flowers, 25% fruits 25% closed buds, 75% flowers, 0% fruits |
| 5 | Specimen mainly in flowers | 0% closed buds, 100% flowers, 0% fruits 25% closed buds, 50% flowers, 25% fruits |
| 6 | Specimen mainly in flowers and fruits | 0% closed buds, 75% flowers, 25% fruits25% closed buds, 25% flowers, 50% fruits50% closed buds, 0% flowers, 50% fruits |
| 7 | Specimen essentially in flowers and fruits | 0% closed buds, 50% flowers, 50% fruits |
| 8 | Specimen mainly in fruits | 0% closed buds, 25% flowers, 75% fruits25% closed buds, 0% flowers, 75% fruits |
| 9 | Specimen essentially in fruits | 0% closed buds, 0% flowers, 100% fruits |
Distribution of closed buds, flowers, and fruit on the specimen.
Figure 2Illustration of the nine different phenophases of Coreopsis gladiata recorded in the PHENO data set.
Data distribution and results of the fertility detection accuracy obtained in EXP1‐Fertility.a
| Evaluated models | Training set | Test set A | Test set B | Test set C |
|---|---|---|---|---|
| Data set size | 120,739 | 13,415 | 14,539 | 14,540 |
| Percentage of fertile specimens | 86.4% | 86.2% | 87.4% | 91.1% |
| ResNet50‐Large (400 × 250 pixels) | — | 94.9% | 93.6% | 93.2% |
| ResNet50‐VeryLarge (800 × 550 pixels) | — | 96.3% | 95.2% | 92.0% |
Test set A = Random‐split, test set B = Species‐split (747 species), test set C = Herbarium‐split (nine NEVP herbaria).
Default image size 900 × 600 pixels.
Distribution of angiosperms, ferns, and gymnosperms in the data sets used for experiment EXP1‐Fertility and results of the fertility detection accuracy obtained in that experiment for test set A (Random‐split)
| Evaluated clades | Data distribution | Fertility detection accuracy | ||||
|---|---|---|---|---|---|---|
| Whole data set | Training set | Test set A | Test set B | Test set C | Test set A | |
| Angiosperms | 91.47% | 90.99% | 90.49% | 87.8% | 100% | 96.3% |
| Ferns and allies | 8.51% | 8.98% | 9.47% | 12.2% | 0 | 95.7% |
| Gymnosperms | 0.02% | 0.03% | 0.04% | 0 | 0 | 100% |
Data distribution and results of the flower and fruit detection accuracy obtained in EXP2‐Fl.Fr
| Evaluated models | Training set | Test set A (Random‐split) | Test set B (Species‐split) | Test set C (Herbarium‐split) |
|---|---|---|---|---|
| Data set size | 109,467 | 12,095 | 12,723 | 14,066 |
| Percentage of specimens in flower | 60.9% | 60.6% | 62.5% | 68.5% |
| Percentage of specimens in fruit | 43.3% | 43.4% | 44.9% | 32.5% |
| ResNet50‐flowers | — | 84.3% | 81.0% | 87.0% |
| ResNet50‐fruits | — | 80.5% | 76.6% | 79.6% |
Data distribution and results of the phenophase detection accuracy obtained in EXP3‐Pheno. Coarse classification accuracy is computed based on grouping phenophase categories by 3
| Evaluated model | Training set size (No. of images) | Test set size (No. of images) | Fine‐grained classification accuracy | Coarse classification accuracy | Accuracy when tolerating a [±1] error range | Accuracy when tolerating a [±2] error range | Fine‐grained mean L1 error | Coarse mean L1 error |
|---|---|---|---|---|---|---|---|---|
| ResNet50‐Pheno | 16,298 | 4073 | 43.4% | 69.0% | 67.1% | 82.8% | 1.35 | 0.37 |
Data distribution and results of the phenophase detection accuracy obtained in EXP3‐Pheno, per phenophase categories
| Phenophase | Data distribution in the training data set | Data distribution in the test data set | Classification accuracy |
|---|---|---|---|
| 1 | 10.4% | 10.4% | 74.8% |
| 2 | 7.6% | 7.5% | 24.4% |
| 3 | 9.5% | 9.5% | 27.9% |
| 4 | 7.1% | 7.2% | 8.6% |
| 5 | 17.3% | 17.2% | 60.8% |
| 6 | 7.2% | 7.2% | 6.8% |
| 7 | 10.5% | 10.6% | 18.8% |
| 8 | 10.5% | 10.5% | 18.0% |
| 9 | 19.9% | 19.9% | 78.9% |
Human annotated.
Figure 3Fertility receiver operating characteristic (ROC) curves for EXP1‐Fertility, with ResNet50‐Large (A) and ResNet50‐VeryLarge (B). Blue = test set A (Random‐split); orange = test set B (Species‐split); green = test set C (Herbarium‐split); red stars = percentage of fertile specimens correctly detected at a false positive rate of 5%; black stars = percentage of fertile specimens correctly detected at a false positive rate of 1%.
Figure 4Fertility receiver operating characteristic (ROC) curves for EXP1‐Fertility, with ResNet50‐VeryLarge.
Figure 5Flower detection (A) and fruit detection (B) receiver operating characteristic (ROC) curves for EXP2‐Fl.Fr, with ResNet50‐VeryLarge. Blue = test set A (Random‐split); orange = test set B (Species‐split); green = test set C (Herbarium‐split).
Figure 6L1 error cumulative distribution for phenophase detection experiment (EXP3‐Pheno) using the PHENO data set.
Figure 7Row‐wise normalized confusion matrix of phenophase classification experiment (EXP3‐Pheno) using the PHENO data set.
Comparison of model accuracy with human annotations based on the re‐annotation of 100 herbarium specimens of test set A of EXP1‐Fertility.a
| Annotation types | True positive subset accuracy | False positive subset accuracy | True negative subset accuracy | False negative subset accuracy | Overall accuracy on these subsets | Global accuracy on test set A |
|---|---|---|---|---|---|---|
| ResNet50‐VeryLarge | 100.0% | 0.0% | 100.0% | 0.0% | 50.0% | 96.3% |
| Human annotation | 88.0% | 68.0% | 88.0% | 76.0% | 80.0% | 87.8% |
The global accuracy on the whole test set is computed using the average of the accuracy on each subset weighted by their proportion in the whole test set, i.e., 84.1%, 1.7%, 12.2%, and 2.0%, respectively, for the true positive, false positive, true negative, and false negative subsets.
Annotations were made by co‐author P.B.
Figure 8Illustration of some difficult specimens to annotate in EXP1‐Fertility. (A) Fertile specimen of Hymenophyllum hirsutum, wrongly annotated by the ResNet50‐VeryLarge model and the human observer (P.B.). Fertility is expressed by small terminal sori (1–1.5 mm in diameter) at the extremity of the lamina. Red arrows show them on a close‐up of the lamina. (B) Fertile specimen of Cordia fanchoniae, wrongly annotated by the ResNet50‐VeryLarge model and correctly annotated by the human observer (P.B.). Fertility is expressed by a small young infructescence (1.3 cm high, marked by red arrows), just after anthesis and before the development of fruits.
| Institution names | Institution codes |
|---|---|
| Arnold Arboretum, Harvard University | A |
| Bartlett Arboretum | BART |
| Boston University | BSN |
| Brown University | BRU |
| Connecticut Botanical Society | NCBS |
| Central Connecticut State University | CCSU |
| Connecticut College | CCNL |
| University of Connecticut | CONN |
| Economic Herbarium of Oakes Ames, Harvard University | ECON |
| Farlow Herbarium, Harvard University | FH |
| Gray Herbarium, Harvard University | GH |
| Keene State College | KESC |
| New York Botanical Garden | NY |
| New England Botanical Club | NEBC |
| Rutgers University | CHRB |
| University of Maine | MAINE |
| University of Massachusetts | MASS |
| University of New Hampshire | NHA |
| University of Rhode Island | KIRI |
| University of Vermont | VT |
| Western Connecticut State University | WCSU |
| Westfield State University | WSCH |
| Wilton Garden Club | WGCH |
| Yale Peabody Museum of Natural History, Yale University | YU |
All institutions are located in the United States.
| Institution names | Institution codes |
|---|---|
| Archbold Biological Station | ARCH |
| Arizona State University | ASU |
| Ohio University | BHO |
| Centre County Historical Society | CCHS |
| The College of Idaho | CIC |
| Central Michigan University | CMC |
| University of Connecticut | CONN |
| Converse College | CONV |
| Denver Botanic Gardens | DBG |
| Desert Botanical Garden | DES |
| Eastern Illinois University | EIU |
| The Ronald L. Jones Herbarium | EKY |
| Eastern Michigan University | EMC |
| Florida Museum of Natural History | FLAS |
| Florida State University | FSU |
| University of Georgia | GA |
| Gray Herbarium, Harvard University | GH |
| Georgia Southwestern State University | GSW |
| Hope College | HCHM |
| Portland State University | HPSU |
| University of Idaho | ID |
| Idaho State University | IDS |
| University of Illinois | ILL |
| Illinois Natural History Survey, Illinois Dept. of Natural Resources | ILLS |
| Keene State College | KESC |
| University of Rhode Island | KIRI |
| Louisiana State University | LSU |
| University of Michigan | MICH |
| University of Minnesota | MIN |
| University of Mississippi | MISS |
| Mississippi State University | MISSA |
| Missouri Botanical Garden | MO |
| Morton Arboretum, Illinois | MOR |
| Marshall University | MUHW |
| North Carolina State University | NCSC |
| University of North Carolina at Chapel Hill | NCU |
| Natural History Museum of United Kingdom | NHMUK |
| New York Botanical Garden | NY |
| Marie Selby Botanical Gardens | SEL |
| Boise State University | SRP |
| Troy University | TROY |
| Tall Timbers Research Station | TTRS |
| James F. Matthews Center for Biodiversity Studies | UNCC |
| University of New Mexico | UNM |
| Smithsonian Institution | US |
| University of South Carolina | USCH |
| University of South Carolina Upstate | USCS |
| University of South Florida | USF |
| University of Southern Mississippi | USMS |
| Intermountain Herbarium, Utah State University | USU |
| University of West Florida | UWFP |
| University of Wisconsin–La Crosse | UWL |
| University of Wisconsin–Milwaukee | UWM |
| Valdosta State University | VSC |
| Western Carolina University | WCUH |
| Whitman College | WCW |
| University of Wisconsin–Madison | WIS |
All institutions are located in the United States.
Values of the initial learning rate, the batch size, and the number of epochs for each experiment
| Experiment | Initial learning rate | Batch size | No. of epochs |
|---|---|---|---|
|
EXP1‐Fertility | 0.001 | 48 | 45 |
|
EXP1‐Fertility | 0.001 | 12 | 45 |
|
EXP2‐Fl.Fr | 0.01 | 12 | 45 |
|
EXP3‐Pheno | 0.001 | 8 | 30 |