| Literature DB >> 29962832 |
Jana Wäldchen1, Patrick Mäder2.
Abstract
Species knowledge is essential for protecting biodiversity. The identification of plants by conventional keys is complex, time consuming, and due to the use of specific botanical terms frustrating for non-experts. This creates a hard to overcome hurdle for novices interested in acquiring species knowledge. Today, there is an increasing interest in automating the process of species identification. The availability and ubiquity of relevant technologies, such as, digital cameras and mobile devices, the remote access to databases, new techniques in image processing and pattern recognition let the idea of automated species identification become reality. This paper is the first systematic literature review with the aim of a thorough analysis and comparison of primary studies on computer vision approaches for plant species identification. We identified 120 peer-reviewed studies, selected through a multi-stage process, published in the last 10 years (2005-2015). After a careful analysis of these studies, we describe the applied methods categorized according to the studied plant organ, and the studied features, i.e., shape, texture, color, margin, and vein structure. Furthermore, we compare methods based on classification accuracy achieved on publicly available datasets. Our results are relevant to researches in ecology as well as computer vision for their ongoing research. The systematic and concise overview will also be helpful for beginners in those research fields, as they can use the comparable analyses of applied methods as a guide in this complex activity.Entities:
Year: 2017 PMID: 29962832 PMCID: PMC6003396 DOI: 10.1007/s11831-016-9206-z
Source DB: PubMed Journal: Arch Comput Methods Eng ISSN: 1134-3060 Impact factor: 7.302
Fig. 1Generic steps of an image-based plant classification process (green-shaded boxes are the main focus of this review). (Color figure online)
Fig. 2Study selection process
Seeding set of papers for the backward and forward snowballing
| Study | Journal | Topic | Year |
|
|
|---|---|---|---|---|---|
| Gaston and O'Neill [ | Philosophical Transactions of the Royal Society of London | Roadmap paper on automated species identification | 2004 | 91 | 215 |
| MacLeod et al. [ | Nature | Roadmap paper on automated species identification | 2010 | 10 | 104 |
| Cope et al. [ | Expert Systems with Applications | Review paper on automated leaf identification | 2012 | 113 | 108 |
| Nilsback et al. [ | Indian Conference on Computer Vision, Graphics and Image Processing | Study paper on automated flower recognition | 2008 | 18 | 375 |
| Du et al. [ | Applied Mathematics and Computation | Study paper on automated leaf recognition | 2007 | 20 | 215 |
Table notes: Number of citations based on Google Scholar, accessed June 2016
Simplified overview of the data extraction template
|
| |
| Study identifier | |
| Year of publication | [2005–2015] |
| Country of all author(s) | |
| Authors’ background | [Biology/Ecology, Computer science/Engineering, Education] |
| Publication type | [journal, conference proceedings] |
|
| |
| Depicted organ(s) | [leaf, flower, fruit, stem, whole plant] |
| No. of species | |
| No. of images | |
| Image source(s) | [own dataset, existing dataset] |
| (a) own dataset | [fresh material, herbarium specimen, web] |
| (b) existing dataset | [name, no. of species, no. of images, source] |
| Image type | [photo, scan, pseudo-scan] |
| Image background | [natural, plain] |
| Considering: | |
| (a) damaged leaves | [yes, no] |
| (b) overlapped leaves | [yes, no] |
| (c) compound leaves | [yes, no] |
|
| |
| Studied organ | [leaf, flower, fruit, stem, whole plant] |
| Studied feature(s) | [shape, color, texture, margin, vein] |
| Studied descriptor(s) | |
|
| |
| Utilized dataset | |
| No. of species | |
| Studied feature(s) | |
| Applied classifier | |
| Achieved accuracy | |
|
| |
| Prototype name | |
| Type of application | [mobile, web, desktop] |
| Computation | [online, offline] |
| Publicly available | [yes, no] |
| Supported organ | [leaf, flower, multi-organ] |
| Expected background | [plain, natural] |
aMultiple values possible
Fig. 3Number of studies per year of publication
Fig. 4Leaf structure, leaf types, and flower structure
Overview of utilized image data
| Organ | Background | Image category | Studies |
|
|---|---|---|---|---|
| Leaf | Plain | Scans | [ | 23 |
| Pseudo-scans | [ | 19 | ||
| Scans + pseudo-scans | [ | 43 | ||
| Illustrated leaf images | [ | 4 | ||
| [No information] | [ | 6 | ||
| Natural | Photos | [ | 3 | |
| Plain + natural | Scans + pseudo-scans + photos | [ | 9 | |
| Flower | Natural | Photos | [ | 14 |
| Stem, fruit, full plant | Natural | Photos | [ | 1 |
Overview of utilized image datasets
| Organ | Dataset | Studies |
| |
|---|---|---|---|---|
| Leaf | Own dataset | Self-collected (imaged in lab) | [ | 46 |
| Web | [ | 2 | ||
| Existing dataset | ImageCLEF11/ImageCLEF12 | [ | 20 | |
| Swedish leaf | [ | 12 | ||
| ICL | [ | 12 | ||
| Flavia | [ | 19 | ||
| Leafsnap | [ | 6 | ||
| FCA | [ | 1 | ||
| Korea Plant Picture Book | [ | 2 | ||
| Middle European Woody Plants (MEW) | [ | 1 | ||
| Southern China Botanical Garden | [ | 1 | ||
| Tela Database | [ | 1 | ||
| [No information] | [ | 6 | ||
| Flower | Own dataset | Self-collected (imaged in field) | [ | 1 |
| Self-collected (imaged in field) + web | [ | 6 | ||
| Existing dataset | Oxford 17, Oxford 102 | [ | 3 | |
| [No information] | [ | 4 | ||
| Flower, leaf, bark, fruit, full plant | Existing dataset | Social image collection | [ | 1 |
Fig. 5Distribution of the maximum evaluated species number per study. Six studies [76, 100, 101, 107, 108, 112] provide no information about the number of studied species. If more than one dataset per paper was used, species numbers refer to the largest dataset evaluated
Fig. 6Distribution of the maximum evaluated images number per study. Six studies [10, 53, 76, 118, 132, 135] provide no information about the number of used images. If more than one dataset per paper was used, image numbers refer to the largest dataset evaluated
Studied organs and features
| Organ | Feature | Studies |
|
|---|---|---|---|
| Leaf | Shape | [ | 56 |
| Texture | [ | 12 | |
| Margin | [ | 2 | |
| Vein | [ | 4 | |
| Shape + texture | [ | 9 | |
| Shape + color | [ | 4 | |
| Shape + margin | [ | 6 | |
| Shape + vein | [ | 10 | |
| Shape + color + texture | [ | 2 | |
| Shape + color + texture + vein | [ | 2 | |
| Flower | Shape | [ | 3 |
| Shape + color | [ | 5 | |
| Shape + texture | [ | 1 | |
| Shape + texture + color | [ | 5 | |
| Bark + fruit | Shape + texture | [ | 1 |
| Full plant | Shape + texture + color | [ | 1 |
Fig. 7Categorization (green shaded boxes) and overview (green framed boxes) of the most prominent feature descriptors in plant species identification. Feature descriptors partly fall in multiple categories. (Color figure online)
Studies analyzing the shape of organs solely or in combination with other features
| Organ | Features | Shape descriptor | Studies |
|---|---|---|---|
| Leaf |
| SMSD | [ |
| SMSD, FD | [ | ||
| SMSD, moments (Hu) | [ | ||
| SMSD, moments (TMI), FD | [ | ||
| SMSD, moments (Hu, ZMI), FD | [ | ||
| SMSD, DFH | [ | ||
| Moments (Hu) | [ | ||
| Moments (Hu, ZMI) | [ | ||
| Moments (ZMI, LMI, TMI) | [ | ||
| FD | [ | ||
| CCD | [ | ||
| CCD, AC | [ | ||
| AT | [ | ||
| TAR, TSL, TOA, TSLA | [ | ||
| TAR, TSL, SC, salient points description | [ | ||
| CSS | [ | ||
| SMSD, CSS | [ | ||
| CSS, velocity representation | [ | ||
| HoCS | [ | ||
| SRVF | [ | ||
| IDSC | [ | ||
| I-IDSC, Gaussian shape pyramid | [ | ||
| MDM | [ | ||
| SIFT | [ | ||
| HOG | [ | ||
| HOG, central moments of order | [ | ||
| SURF | [ | ||
| Multi-scale overlapped block LBP | [ | ||
| MARCH | [ | ||
| Describe leaf edge variation | [ | ||
| FD, procrustes analysis | [ | ||
| Polygonal approximation, invariant attributes sequence representation | [ | ||
| Minimum perimeter polygons | [ | ||
| HOUGH, Fourier, EOH, LEOH, DFH | [ | ||
| Moments (Hu), centroid-Radii model, binary-Superposition | [ | ||
| Isomap, supervised isomap | [ | ||
| MLLDE algorithm | [ | ||
| MICA | [ | ||
| Parameters of the compound leaf model, parameters of the polygonal leaflet model, averaged parameters of base and apex models, averaged CSS-based contour parameters | [ | ||
| Leaf landmarks (leaf apex, the leaf base, centroid) | [ | ||
| Detecting different leaf parts (local translational symmetry of small regions, local symmetry of depth indention) | [ | ||
| Detecting petitole shape (local translational symmetry of width) | [ | ||
| Geometric properties of local maxima and inflexion points | [ | ||
|
| SMSD | [ | |
| SMSD, FD | [ | ||
| PHOG, Wavelet features | [ | ||
| SIFT | [ | ||
|
| CSS, detecting teeth and pits | [ | |
| SMSD, moments (Hu), MDM, AMD | [ | ||
| Advanced SC | [ | ||
|
| SMSD, moments (Hu) | [ | |
| CCD, AC | [ | ||
| CT, moments (Hu) | [ | ||
| Multi-resolution and multi-directional CT | [ | ||
| RSC | [ | ||
| CDS | [ | ||
| SC | [ | ||
| Advanced SC | [ | ||
| DS-LBP | [ | ||
| SURF, EOH, HOUGH | [ | ||
|
| SMSD | [ | |
| RPWFF, FracDim, moments (Hu) | [ | ||
| FracDim | [ | ||
| SC, SIFT | [ | ||
| Minimum perimeter polygons | [ | ||
| Contour covariance | [ | ||
|
| SIFT, high curvature points on the contour | [ | |
| SMSD, BSS, RMI, ACH, CPDH, FD | [ | ||
|
| SMSD | [ | |
| Flower |
| SMSD | [ |
| Mathematical descriptor for petal shape | [ | ||
| Zero-crossing rate, the minimum distance, contour line’s length from the contour image | [ | ||
|
| Shape density distribution, edge density distribution | [ | |
| SMSD, moments (Hu), FracDim, CCD | [ | ||
| SIFT, Dense SIFT, feature context | [ | ||
| CDD | [ | ||
| CDS, SMSD | [ | ||
|
| SIFT | [ | |
|
| Edge densities, edge directions, moments (Hu) | [ | |
| CSS | [ | ||
| SIFT | [ | ||
| SIFT, HOG | [ | ||
| SURF, EOH, HOUGH | [ | ||
| Fruit, bark, full plant |
| SURF, EOH, HOUGH | [ |
Abbreviations not explained in the text–BSS basic shape statistics, CPDH contour point distribution histogram, CT curvelet transform, EOH edge orientation histogram, DFH directional fragment histogram, DS-LBP dual-scale decomposition and local binary descriptors, Fourier Fourier histogram, HOUGH histogram of lines orientation and position, LEOH local edge orientation histogram, MICA multilinear independent component analysis, MLLDE modified locally linear discriminant embedding, PHOG pyramid histograms of oriented gradients, RMI regional moments of inertia, RPWFF ring projection wavelet fractal feature, RSC relative sub-image coefficients
Simple and morphological shape descriptors (SMSD)
| Descriptor | Explanation | Pictogram | Formula | Studies |
|
|---|---|---|---|---|---|
|
| Longest distance between any two points on the margin of the organ |
| [ | 6 | |
|
| Line segment connecting the base and the tip of the leaf |
| [ | 6 | |
|
| Maximum width that is perpendicular to the |
| [ | 6 | |
|
| Number of pixels in the region of the organ |
| [ | 8 | |
|
| Summation of the distances between each adjoining pair of pixels around the border of the organ |
| [ | 8 | |
|
| Represents the coordinates of the organ’s geometric center |
| [ | 2 | |
|
| Ratio of |
|
| [ | 19 |
|
| Illustrates the difference between a organ and a circle |
|
| [ | 16 |
|
| Ratio of the |
|
| [ | 3 |
|
| Represents how rectangle a shape is, i.e., how much it fills its minimum bounding rectangle |
|
| [ | 10 |
|
| Ratio of the distance between the foci of the ellipse ( |
| [ | 5 | |
|
| Ratio of the |
|
| [ | 5 |
|
| Ratio of the |
|
| [ | 3 |
|
| Ratio of the |
|
| [ | 3 |
|
| Ratio of object |
|
| [ | 5 |
|
| The convex hull of a region is the smallest region that satisfies two conditions: (1) it is convex, and (2) it contains the organ’s region |
| [ | 4 | |
|
| Ratio of the |
|
| [ | 4 |
|
| Normalized difference of the |
|
| [ | 1 |
|
| Ratio between organ’s |
|
| [ | 6 |
|
| Ratio of the radius of the inside circle of the bounding box ( |
|
| [ | 4 |
|
| Diameter of a circle with the same area as the organ’s |
|
| [ | 1 |
|
| Represents the mapping error of a shape to fit an ellipse with same covariance matrix as the shape |
| [ | 2 | |
|
| Ratio between organ’s |
| [ | 2 | |
|
| The leaf is sliced, perpendicular to the major axis, into a number of vertical strips. Then for each strip ( |
|
| [ | 1 |
|
| The leaf is sliced, perpendicular to the major axis, into a number of vertical strips. Then for each strip ( |
|
| [ | 1 |
|
| Portion of cracks in leaf image; |
| [ | 1 |
Studies analyzing the color of organs in combination with other features
| Organ | Feature | Color descriptor | Studies |
|---|---|---|---|
| Leaf | Shape, | CM, CH | [ |
| CM | [ | ||
| Shape, | CH, CCM | [ | |
| CM, CH | [ | ||
| Shape, | CM | [ | |
| Flower |
| CH | [ |
| CSIFT | [ | ||
| Shape, | CH | [ | |
| Full plant | Shape, | CH | [ |
Studies analyzing the texture of organs solely or in combination with other features
| Organ | Feature | Texture descriptor | Studies |
|---|---|---|---|
| Leaf |
| GF | [ |
| GF, GLCM | [ | ||
| LGPQ | [ | ||
| FracDim | [ | ||
| CT | [ | ||
| EAGLE, SURF | [ | ||
| [No information] | [ | ||
| Shape, | DWT | [ | |
| EOH | [ | ||
| Fourier, EOH | [ | ||
| RSC | [ | ||
| DS-LBP | [ | ||
| EnS | [ | ||
| GF, GLCM | [ | ||
| Gradient histogram | [ | ||
| Shape, color, | GF | [ | |
| EOH, GF | [ | ||
| Shape, color, | GIH, GLCM | [ | |
| GLCM | [ | ||
| Flower | Shape, | SFTA | [ |
| Shape, color, | Statistical attributes (mean, sd) | [ | |
| EOH | [ | ||
| Fourier, EOH | [ | ||
| Leung-Malik filter bank | [ | ||
| Fruit, bark | Shape, | Fourier, EOH | [ |
| Full plant | Shape, color, | Fourier, EOH | [ |
Abbreviations not explained in the text—CT curvelet transform, DWT discrete wavelet transform, EnS entropy sequence, Fourier Fourier histogram, RSC relative sub-image coefficients
Studies analyzing leaf-specific features either solely or in combination with other leaf features
| Organ | Feature | Leaf-specific descriptor | Studies |
|---|---|---|---|
| Leaf |
| Run-length features | [ |
| Leaf vein and areoles morphology | [ | ||
| Shape, | Graph representations of veins | [ | |
|
| [ | ||
| Calculating the density of end points and branch points | [ | ||
| FracDim | [ | ||
| SC,SIFT | [ | ||
| Extended circular covariance histogram | [ | ||
| Color, shape, texture, |
| [ | |
|
| Margin signature | [ | |
| Leaf tooth features (total number of leaf teeth, ratio between the number of leaf teeth and the length of the leaf margin expressed in pixels, leaf-sharpness and leaf-obliqueness) | [ | ||
| SC-based descriptors: leaf contour, spatial correlation between salient points of the leaf and its margin | [ | ||
| Shape | CSS | [ | |
| Sequence representation of leaf margins where teeth are viewed as symbols of a multivariate real valued alphabet | [ | ||
| Morphological properties of margin shape (13 attributes) | [ | ||
| Margin statistics (average peak height, peak height variance, average peak distance and peak distance variance) | [ |
Comparison of classification accuracy on the Swedish leaf dataset containing twelve species
| Descriptor | Feature | Classifier | Accuracy | Studies |
|---|---|---|---|---|
| GF | Texture | Fuzzy k-NN | 85.75 | [ |
| FD | Shape | 1-NN | 87.54 | [ |
| SC | Shape | k-NN | 88.12 | [ |
| FD | Shape | k-NN | 89.60 (83.60) | [ |
| HoCS | shape | Fuzzy k-NN | 89.35 | [ |
| TAR | Shape | k-NN | 90.40 | [ |
| HOG | Shape | 1-NN | 93.17 (92.98) | [ |
| MDM–ID | Shape | k-NN | 93.60 (90.80) | [ |
| IDSC | Shape | 1-NN | 93.73 (85.07) | [ |
| IDSC | Shape | SVM | 93.73 | [ |
| IDSC | Shape | k-NN | 94.13 (85.07) | [ |
| TOA | Shape | k-NN | 95.20 | [ |
| TSL | Shape | k-NN | 95.73 | [ |
| TSLA | Shape | k-NN | 96.53 | [ |
| LBP | Shape | SVM | 96.67 | [ |
| I-IDSC | Shape | 1-NN | 97.07 | [ |
| MARCH | Shape | 1-NN | 97.33 | [ |
| DS-LBP | Shape + texture | Fuzzy k-NN | 99.25 | [ |
The original images of the Swedish leaf dataset contain leafstalks. Numbers in brackets are results obtained after removing leafstalks
Comparison of classification accuracies on the ICL dataset (220 species) and its two subsets (50 species each)
| Descriptor | Feature | Classifier | Accuracy | Studies |
|---|---|---|---|---|
|
| ||||
| FD | Shape | 1-NN | 60.08 | [ |
| TAR | Shape | 1-NN | 78.25 | [ |
| IDCS | Shape | 1-NN | 81.39 | [ |
| IDSC | Shape | k-nn | 83.79 | [ |
| GF | Texture | Fuzzy k-NN | 84.60 | [ |
| MARCH | Shape | 1-NN | 86.03 | [ |
| HoCS | Shape | Fuzzy k-NN | 86.27 | [ |
| MDM | Shape | Fuzzy k-NN | 88.24 | [ |
| IDSC | Shape | Fuzzy k-NN | 90.75 | [ |
| SIFT, SC | Shape + vein | k-NN | 91.30 | [ |
| EnS and CDS | Shape + texture | SVM | 95.87 | [ |
| DS-LBP | Shape + texture | Fuzzy k-NN | 98.00 | [ |
|
| ||||
| IDSC | Shape | SVM | 95.79 (63.99) | [ |
| FD | Shape | 1-NN | 96.00 (80.88) | [ |
| HOG | Shape | SVM | 96.63 (83.35) | [ |
| LBP | Shape | SVM | 97.70 (92.80) | [ |
| IDSC | Shape | 1-NN | 98.00 (66.64) | [ |
| MDM with ID | Shape | 1-NN | 98.20 (80.80) | [ |
| HOG | Shape | 1-NN | 98.92 (89.40) | [ |
| I-IDSC | Shape | 1-NN | 99.48 (88.40) | [ |
Certain studies used two subsets of the ICL leaf dataset: subset A and subset B (in brakets). Subset A includes 50 species with shapes easily distinguishable by humans. Subset B includes 50 species with very similar but still visually distinguishable shapes
Comparison of classification accuracies on the FLAVIA dataset with 32 species
| Descriptor | Feature | Classifier | Accuracy | Study |
|---|---|---|---|---|
| Hu moments | Shape | SVM | 25.30 | [ |
| HOG | Shape |
| ||
| SIFT | Shape | 87.50 | [ | |
| SMSD, | Shape + vein | PNN | 90.31 | [ |
| SMSD | Shape | 70.09 | ||
| PFT | Shape | k-NN | 76.69 | [ |
| SMSD, FD | Shape | 84.45 | ||
| SMSD, FD, CM | Color + shape | k-NN, DT |
| |
| SMSD | Shape | PNN | 91.40 | [ |
| SMSD, | Shape + vein | SVM (k-NN) |
| [ |
| SIFT | Shape | SVM | 95.47 | [ |
| SURF | Shape | SVM | 95.94 | [ |
| SMSD, FD | Shape | BPNN | 96.00 | [ |
| SMSD, CM, GLCM, | Shape + color + texture + vein | SVM | 96.25 | [ |
| SMSD | Shape | 87.61 (82.34, 80.26, 72.89) | [ | |
| SMSD, CM | Shape + color | RF (k-NN, NB, SVM) | 93.95 (92.46, 88.77, 86.50) | |
| SMSD, CM, CH | Shape + color |
| ||
| SMSD | Shape | NFC | 97.50 | [ |
| CT, Hu moments | Shape | 50.16 (41.60) | [ | |
| GF, GLCM | Texture | NFC (MLP) | 81.60 (87.10) | |
| CT, Hu moments, GF, GLCM | Shape + texture |
| ||
| EnS and CDS | Shape + texture | SVM |
| [ |
Prototypical applications implementing proposed approaches
| Name | Application type | Organ | Background | Analysis | URL | Studies |
|---|---|---|---|---|---|---|
| LeafView | Mobile (Tablet PC) | Single leaf | Plain | Offline | [ | |
| LeafSnap | Mobile (iOS) | Single leaf | Plain | Online |
| [ |
| FOLIA | Mobile (iOS) | Single leaf | Natural | Online |
| [ |
| ApLeafis | Mobile (Android) | Single leaf | Plain | Offline | [ | |
| – | Mobile (Android) | Single leaf | Plain | Online | [ | |
| – | Mobile (Android) | Single leaf | Plain | Offline | [ | |
| – | Mobile (Android) | Single leaf | Plain | Offline/online | [ | |
| – | Mobile (Android) | Single leaf | Plain | Online | [ | |
| – | Mobile (iOS) + web | Single leaf | Plain | Offline/online | [ | |
| CLOVER | Mobile (PDA) | Single leaf | Plain | Online | [ | |
| MOSIR | Mobile | Flower | Natural | Online | [ | |
| Leaves Lite | Web | Single leaf | Plain | Online | [ | |
| Pl@ntNet-Identify | Web | Multi organ | Plain | Online |
| [ |
| Chloris | Desktop | Single leaf | Plain | Offline | [ | |
| Leaf recognition | Desktop | Single leaf | Plain | Offline | [ | |
| – | Desktop | Single leaf | Natural | Offline | [ | |
| Flower recognition system | Desktop | Flower | Natural | Offline | [ |