| Literature DB >> 29048559 |
Fernando Perez-Sanz1, Pedro J Navarro2, Marcos Egea-Cortines1.
Abstract
The study of phenomes or phenomics has been a central part of biology. The field of automatic phenotype acquisition technologies based on images has seen an important advance in the last years. As with other high-throughput technologies, it addresses a common set of problems, including data acquisition and analysis. In this review, we give an overview of the main systems developed to acquire images. We give an in-depth analysis of image processing with its major issues and the algorithms that are being used or emerging as useful to obtain data out of images in an automatic fashion.Entities:
Keywords: Algorithms; artificial vision; deep learning; hyperspectral cameras; machine learning; segmentation
Mesh:
Year: 2017 PMID: 29048559 PMCID: PMC5737281 DOI: 10.1093/gigascience/gix092
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Basic workflow in computer vision–based plant phenotyping.
List of software tools for image processing
| Vision libraries | Source | Language |
|---|---|---|
| OpenCV |
| C++, Python, Java, C# |
| EmguCV |
| |
| PlantCV |
| Python |
| Scikit-image |
| |
| Bioimagetools, bayesimages, edci, DRIP, dpmixsim, raster, … |
| R |
| Cimg |
| C++ |
| Simplecv |
| |
| Fastcv |
| |
| Ccv |
| |
| Vxl |
| |
| BoofCV |
| Java |
| OpenIMAJ |
| |
| JavaCV |
|
Figure 2:An overview of different spectra used for phenotyping and the associated cameras. The names of different indexes are found in Table 2.
A list of indexes, the corresponding wavelength ranges, and their use to analyse plant material
| Index | Range, nm | Applications |
|---|---|---|
| CAI—Cellulose Absorption Index | 2200–2000 | Quantification of mixed soil–plant litter scenes [ |
| LCA—Lignin-Cellulose Absorption Index | 2365–2145 | Measure of the effects of soil composition and mineralogy of crop residue cover [ |
| NTDI—Normalized Difference Tillage Index | 2359–1150 | Used for identifying crop residue cover in conventional and conservation tillage systems [ |
| LWVI-1 – Normalized Difference Leaf water VI 2 | 1094–893 | Discrimination of sugarcane varieties, allowed to detect large amounts of non-photosynthetically active constituents within the canopy [ |
| DLAI—Difference Leaf Area Index | 1725–970 | Used for estimating leaf area index based on the radiation measurements in the visible and near-infrared [ |
| PWI—Plant Water Index | 970–902 | Water content estimation and study of the characteristics of canopy spectrum and growth status [ |
| NLI—Nonlinear Vegetation Index | 1400–780 | Measurement of plant leaf water content; in combination with others, indexes can detect interaction of biochemicals such as protein, nitrogen, lignin, cellulose, sugar, and starch [ |
| DWSI—Disease Water Stress Index | 1657–547 | To predict larval mosquito presence in wetland [ |
| NDVI—Normalized Difference Vegetation Index | 800–670 | Measurement of significant variations in photosynthetic activity and growing season length at different latitudes [ |
| MCARI—Modified Chlorophyll Absorption Ratio Index | 700–670 | Study of vegetation biophysical parameters, as well as external factors affecting canopy reflectance [ |
| GI—Greenness Index | 670–550 | Characterization of corn nitrogen status [ |
| CAR—Chlorophyll Absorption Ratio | 700–500 | Estimating the concentration of individual photosynthetic pigments within vegetation [ |
| GNDVI—Green Normalized Difference Vegetation Index | 800–550 | Providing important information for site-specific agricultural decision-making [ |
| OSAVI—Optimized Soil Adjusted Vegetation Index | 800–670 | Measurement of highly sensitive chlorophyll content variations that are very resistant to the variations of LAI and solar zenith angle [ |
| CI r—Coloration Index red | 780–710 | Mapping of coastal dune and salt marsh ecosystems [ |
| CI g—Coloration Index green | 780–550 | Characterization of the state of soil degradation by erosion [ |
List of machine learning software libraries and their languages
| Libraries ML/DL | Source | Language |
|---|---|---|
| MICE, rpart, Party, CARET, randomForest, nnet, e1071, KernLab, igraph, glmnet, ROCR, tree, Rweka, earth, klaR, |
| R |
| Scikit-learn |
| Python |
| Tensorflow |
| |
| Theano |
| |
| Pylearn2, |
| |
| NuPIC |
| |
| Caffe |
| |
| PyBrain |
| |
| Weka |
| Java |
| Spark |
| |
| Mallet |
| |
| JSAT |
| |
| ELKI |
| |
| Java-ML |
| |
| Accord |
| C#, C++, C |
| Multiboost |
| |
| Shogun |
| |
| LibSVM |
| |
| mlpack |
| |
| Shark |
| |
| MLC++ |
|
A list of current procedures for image analysis based on the type of sensor used
| Data type/source | Preprocessing | Segmentation | Feature extraction | Machine learning |
|---|---|---|---|---|
| Mono—RGB | *Homomorphic filtering to minimize illumination issues in outdoor images [ | *Many vegetation indexes apply to segmentation in [ | *Fourier descriptors and Zernike moments [ | *ANN to detect Phalaenopsis seedling diseases [ |
| *Filtering and histogram equalization in plant disease detection [ | *NDVI index to discriminate background and foreground [ | *Statistical parameters and Wavelet transform with geometric characteristics [ | *SVM to detect tomato leaf viruses [ | |
| *Cellular neural networks edge detection [ | *SIFT and SURF in 3D reconstruction images from multiple RGB cameras with basil specimen [ | *Gaussian mixture model to detect biotic stress in wheat [ | ||
| *HSV algorithm [ | *Histogram to color features and Fast Fourier Transform + Discrete Wavelet Transform to texture features extraction [ | *k-NN to identify leaf disease [ | ||
| *Probabilistic Neural Networks and Genetic Algorithm [ | ||||
| *Random forest to QTL analysis [ | ||||
| StereoVision | *Complete and general preprocessing pipeline [ | *Otsu's method & growing region [ | *Graph-cut and local correlation [ | *SVM to identify diseased pixels in leaves [ |
| *Rectification of image based on SIFT and epipolar transformation, in vitis vinifera segmentation [ | *SVM to remove background [ | *SURF to stereo-match images based on their feature vectors [ | *SVM & Gaussian Processes Classifier to detect soil moisture deficit [ | |
| *Camera stereo calibration, leaf quantifying | *Combined with thermal images (global and local features (temperature, depth, color) using PCA and analysis of variance [ | |||
| *RGB2GrayScale [ | *Simple statistical and intensity values [ | |||
| *Align and depth estimation [ | ||||
| Multi-Hyper spectral | *Savitzky-Golay filter: remove noise and smooth the image [ | *NDVI (750–705/750+705) nm with threshold of 0.20 [ | *Pixels averaged to obtain average reflectance [ | *Cascade of data mining techniques to detect foliar disease in barley leaves [ |
| *Gaussian filter to remove noise: detection of disease in banana leaves [ | *Bayes, logistic, random forest, and decision trees to detect biotic stress in Alternaria genus [ | |||
| *Savitzky-Golay filter: detection of disease in plants [ | *k-NN to identify leaf disease [ | |||
| *PCA and partial least squares regression to predict water, macronutrient, and micronutrient concentrations [ | ||||
| ToF | *Correction of the distance error caused by the extra contribution of electrons from sunlight using an offset parameter [ | *Combine hierarchical color segmentation with quadratic surface fitting using ToF depth data [ | *SIFT, Hough Transform, and RANSAC algorithms to extract relevant features [ | |
|
|
| |||
| *Removal of spurious individual points (outliers) using statistical filter [ | *Removal of background by simple thresholding pixel values greater than a certain threshold [ | |||
| *Removal of lens distortion [ | *Segmentation inspired from the maximally stable extremal regions algorithm [ | |||
| LIDAR | *RANSAC algorithm to detect ground plane [ | *Clustering to detect individual plants [ | *Statistical features from reflectace and geometry [ | *ANN for Wheat Green Area Index measurement [ |
| *Reduction of noise, filtering point clouds based on deviation [ |
| *ANN, SVM, logistic regression for plant identification (the best results) [ | ||
| *Generalized linear model (the best) to model plant richness [ | ||||
| *SVM obtained a highly reliable classification of about 96% [ | ||||
| Thermography/Flourescence | *Align with stereo images (in combination with stereo images) [ | *Semi-automated segmentation through a geometric algorithm implemented in Python-based software ChopIt [ | *Combined with thermal images (global and local features: temperature, depth, color) using PCA and analysis of variance [ | *SVM to identify diseased pixels in leaves [ |
| *Normalize thermal information with thermal indexes [ | *Manual thresholding comparing conventional color images with fluorescent images (Fv/Fm) [ | *SVM and Gaussian Processes Classifier to detect soil moisture deficit [ | ||
| *Trimming extraneous images from image stack [ | *Analysis of variance (not ML) to analyze different water status [ | |||
| *ANN and SVM to detect zinc deficiency stress using fluorescence imaging [ | ||||
| MRI/Tomography | *2D and 3D Fourier transformations (MRI) [ | *Yang 2011: watershed segmentation [ | *Intensity features, Haralick textural features, intensity local binary pattern features, contrast features, and Gabor intensity textural features [ | *Supervised learning with ANN, Mahalanobis distance, linear discriminant analysis, and quadratic discriminant analysis to determine boundary lines [ |
| *Median filter, binaryzation, fill holes, remove small particles, and morphological filter (erosion) [ | *Histogram thresholding method to binaryze the image [ | |||
| *Re-slicing, cropping, and contrast enhancement [ |