| Literature DB >> 35141328 |
Ankita Gupta1, Lakhwinder Kaur1, Gurmeet Kaur2.
Abstract
Phenomics and chlorophyll fluorescence can help us to understand the various stresses a plant may undergo. In this research work, we observe the image-based morphological changes in the wheat canopy. These changes are monitored by capturing the maximum area of wheat canopy image that has maximum photosynthetic activity (chlorophyll fluorescence signals). The proposed algorithm presented here has three stages: (i) first, derivation of dynamic threshold value by curve fitting of data to eliminate the pixels of low-intensity value, (ii) second, extraction and segmentation of thresholded region by application of histogram-based K-means algorithm iteratively (this scheme of the algorithm is referred to as the curve fit K-means (CfitK-means) algorithm); and (iii) third, computation of 23 grey level cooccurrence matrix (GLCM) texture features (traits) from the wheat images has been done. These features help to do statistical analysis and infer agronomical insights. The analysis consists of correlation, factor, and agglomerative clustering to identify water stress indicators. A public repository of wheat canopy images was used that had normal and water stress response chlorophyll fluorescence images. The analysis of the feature dataset shows that all 23 features are proved fruitful in studying the changes in the shape and structure of wheat canopy due to water stress. The best segmentation algorithm was confirmed by doing exhaustive comparisons of seven segmentation algorithms. The comparisons showed that the best algorithm is CfitK-means as it has a maximum IoU score value of 95.75.Entities:
Year: 2022 PMID: 35141328 PMCID: PMC8820929 DOI: 10.1155/2022/1875013
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1Flow chart of study methodology.
Figure 2Output of curve fitting mathematical function.
Figure 3Segmentation algorithm used: none (input image from which wheat canopy will be extracted); description: original image to be segmented.
Figure 4Segmentation algorithm used: Global Static Thresholding (fixed value); description: some pixel values lost membership in the final segmented image.
Figure 5Segmentation algorithm used: Global Automatic Thresholding (Otsu); description: pixel membership loss is there but less than that of the static thresholding method.
Figure 6Segmentation algorithm used: K-means based on 4 mean values (Otsu); description: pixel membership loss is there but less than that of the static thresholding method.
Figure 7Segmentation algorithm used: watershed; description: the watershed algorithm failed to identify the boundaries properly.
Figure 8Segmentation algorithm used: convolution gradient filters (Sobel, Prewitt, Canny); description: nonsmooth edges for all the three operators.
Figure 9Segmentation algorithm used: CfitK-means; description: best results.
Performance evaluation of segmentation algorithms in terms of IoU (intersection over union) score.
| Algorithm | Sample size | Average IoU score | |||
|---|---|---|---|---|---|
| 25 | 50 | 75 | 100 | ||
| Global Static Thresholding (GST) | 20 | 40 | 67 | 88 | 84.3 |
| Global Automatic Thresholding (GAT) | 19 | 33 | 60 | 76 | 74.5 |
|
| 22 | 46 | 70 | 92 | 91.3 |
| CFitK-means | 24 | 48 | 72 | 95 | 95.75 |
Figure 10IoU score comparison for performance.
Description of glcm metrics that define wheat canopy morphology.
| GLCM features | Formulae used | Agronomical implications |
|---|---|---|
| Autocorrelation (autoc): [ |
| It helps to find how consistent the pattern is in an image matrix in terms of coarseness. Its value ranges from 0 to 1, and 1 conveys maximum coarseness. If there is high coarseness this means the canopy structure is having less density. |
|
| ||
| Contrast: (contr) [ |
| This indicates the difference between the highest and lowest intensity pixels in an image. High contrast means that there is a huge difference between different parts of the object which will aid as a useful tool for canopy segmentation. |
|
| ||
| Correlation (corrm): [ |
| Measures the joint probability of pairs of pixels under observation. This means the pixels within the structure of the canopy have some kind of association with each other in case there is some kind of correlation. |
|
| ||
| Correlation (corrp): [ |
| Measures the association of pairs of pixels under observation. |
|
| ||
| Cluster prominence: (cprom) [ |
| Helps to measure the symmetry in the image matrix. If there is a high level of asymmetry in the canopy, it may be an indication of a shunted growth in the plant due to stress. |
|
| ||
| Cluster shade (cshad) [ |
| Helps to measure skewness which is an indication of asymmetry. A high level of asymmetry implies that there is some problem in the growth of the plant due to which symmetrical canopy is not there. |
|
| ||
| Dissimilarity (Dissi) [ |
| Measure the mean difference between the pixels. It helps infer the level of similarity and homogeneity of the canopy structure. |
|
| ||
| Energy: (Energ) [ |
| Helps to measure the uniformity in the image matrix. There are no significant changes in the canopy morphology. |
|
| ||
| Entropy: (Entro) [ |
| Helps to measure the degree of randomness. High randomness implies that there are a lot of changes occurring in the canopy structure. |
|
| ||
| Homogeneity: Homom (inverse difference moment) [ |
| Measures homogeneity at the local level. There are not many changes occurring in the canopy due to any stimuli. |
|
| ||
| Homogeneity: (Homop) inverse difference [ |
| Measures the closeness of the pixels and the similarity between them. |
|
| ||
| Maximum probability (Maxpr) [ |
| Maxpr is a glcm metric that gives the max probability of finding pixels of interest for finding textures. |
|
| ||
| Sum of squares (sosvh): [ |
| Helps to measure the mean/average shift between the pixels. High variance means there are a lot of changes occurring in the morphological structure. |
|
| ||
| Sum average (savgh) [ |
| Helps to measure the average distribution of the gray levels. A high level of distribution of grey level means that the canopy is undergoing a high level of morphological changes. |
|
| ||
| Sum variance (svarh) [ |
| Helps to measure pixel distributions in terms of dispersion. |
|
| ||
| Sum entropy (senth) [ |
| Helps to measures the disorder related to the gray level sum distribution of the image |
|
| ||
| Difference variance (Dvarh) [ |
| It helps to measure the heterogeneity level in the image. A low level of heterogeneity means images having similar patterns of pixels. |
|
| ||
| Difference entropy (Denth) [ |
| It measures the disorder related to the gray level difference distribution of the image |
| Information measure of correlation1 (inf1h) [ |
| Helps to measure the joint probability in terms of correlation and information it contains. A high value of information measure of correlation means the pixels are highly related to each other in terms of pattern. |
|
| ||
| Information measure of correlation2 (inf2h) [ |
| Helps to measure dependency between the pixels. Higher dependency between pixels implies that if one of the pixels changes it will impact the other pixel and would bring significant changes in the texture of the canopy. |
|
| ||
| Inverse difference (INV) is homom (Homom.1) [ |
| Helps to measure local homogeneity. Indirectly help to monitor the canopy structure. |
|
| ||
| Inverse difference normalized (INN) [ |
| Helps to measure the difference between neighbouring pixel using normalized values. |
|
| ||
| Inverse difference moment normalized (idmnc) [ |
| It computes the difference between the neighbouring intensity values that are normalized by the total number of discrete intensity values. |
Classification of glcm factors based on correlation ranges.
| Correlation relationship | Correlation range | GLCM factors as per correlation |
|---|---|---|
| A (negative correlation) | -1 to 0.09 | [“autocor,” “contr,” “ corrm,” “corrp,” “cprom,” “cshad,” “dissi,” “energ,” “entro,” “homom,” “homop,” “maxpr,” “sosvh,” “savgh,” “svarh,” “senth,” “dvarh,” “denth,” “inf1h,” “inf2h,” “homom.1,” “indnc,” “idmnc”] |
|
| ||
| B (low correlation) | 0 to 0.49 | [“autoc,” “contr,” “corrm,” “corrp,” “cprom,” “cshad,” “dissi,” “entro,” “sosvh,” “savgh,” “svarh,” “senth,” “dvarh,” “denth,” “inf1h,” “inf2h,” “ idmnc”] |
|
| ||
| C (medium correlation) | 0.5 to 0.89 | [“autoc,” “contr,” “cprom,” “cshad,” “dissi,” “energ,” “entro,” “homom,” “homop,” “maxpr,” “sosvh,” “savgh,” “svarh,” “senth,” “dvarh,” “denth,” “inf2h,” “homom1,” “indnc”] |
|
| ||
| D (high correlation) | 0.9 to 1.00 | [“autoc,” “contr,” “corrm,” “corrp,” “cprom,” “cshad,” “dissi,” “energ,” “entro,” “homom,” “homop,” “maxpr,” “sosvh,” “savgh,” “svarh,” “senth,” “dvarh,” “denth,” “homom.1,” “indnc”] |
Agglomeration clustering schedule.
| Stage no. | Cluster combined | Coefficients | Stage cluster first appears | Next stage | ||
|---|---|---|---|---|---|---|
| Cluster 1 | Cluster 2 | Cluster 1 | Cluster 2 | |||
| 1 | 2 | 17 | .000 | 0 | 0 | 14 |
| 2 | 3 | 4 | .000 | 0 | 0 | 20 |
| 3 | 9 | 23 | .000 | 0 | 0 | 6 |
| 4 | 10 | 11 | .367 | 0 | 0 | 10 |
| 5 | 8 | 12 | .452 | 0 | 0 | 10 |
| 6 | 9 | 16 | .849 | 3 | 0 | 9 |
| 7 | 1 | 15 | 1.358 | 0 | 0 | 8 |
| 8 | 1 | 13 | 1.493 | 7 | 0 | 12 |
| 9 | 9 | 18 | 1.820 | 6 | 0 | 17 |
| 10 | 8 | 10 | 3.854 | 5 | 4 | 16 |
| 11 | 5 | 6 | 9.366 | 0 | 0 | 15 |
| 12 | 1 | 14 | 9.856 | 8 | 0 | 15 |
| 13 | 21 | 22 | 24.732 | 0 | 0 | 16 |
| 14 | 2 | 7 | 27.952 | 1 | 0 | 18 |
| 15 | 1 | 5 | 32.500 | 12 | 11 | 17 |
| 16 | 8 | 21 | 58.100 | 10 | 13 | 21 |
| 17 | 1 | 9 | 62.709 | 15 | 9 | 18 |
| 18 | 1 | 2 | 90.114 | 17 | 14 | 19 |
| 19 | 1 | 20 | 257.481 | 18 | 0 | 20 |
| 20 | 1 | 3 | 443.493 | 19 | 2 | 22 |
| 21 | 8 | 19 | 2099.203 | 16 | 0 | 22 |
| 22 | 1 | 8 | 2152.905 | 20 | 21 | 0 |
Comparative cluster membership analysis with different linkages.
| S. no. | Indicator/variables | Between linkage | Centroid linkage | Furthest neighbour linkage | Median linkage | Nearest neighbour linkage | Ward's linkage | Within linkage |
|---|---|---|---|---|---|---|---|---|
| 1 | Autocorrelation (autoc) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 2 | Contrast (contr) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 3 | Correlation (corrm) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 4 | Correlation (corrp) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 5 | Cluster prominence (cprom) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 6 | Cluster shade (cshad) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 7 | Dissimilarity (dissi) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 8 | Energy (energ) | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| 9 | Entropy (entro) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 10 | Homogeneity (homom) | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| 11 | Homogeneity (homop) | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| 12 | Maximum probability (maxpr) | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| 13 | Sum of squares: variance (sosvh) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 14 | Sum average (savgh) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 15 | Sum variance (svarh) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 16 | Sum entropy (senth) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 17 | Difference variance (dvarh) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 18 | Difference entropy (denth) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 19 | Information measure of correlation1 (inf1h) | 3 | 3 | 3 | 3 | 3 | 3 | 3 |
| 20 | Information measure of correlation2 (inf2h) | 3 | 3 | 3 | 3 | 3 | 3 | 3 |
| 21 | Inverse difference (INV) is homom (homom1) | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| 22 | Inverse difference normalized (INN) (indnc) | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| 23 | Inverse difference moment normalized (idmnc) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
Figure 11Visualization of clusters formed.