| Literature DB >> 29387357 |
Mohammad Reza Amiryousefi1, Mohebbat Mohebbi2, Ali Tehranifar3.
Abstract
Application of new procedures for reliable and fast recognition and classification of seeds in the agricultural industry is very important. Recent advances in computer image analysis made applicable the approach of automated quantitative analysis in order to group cultivars according to minor differences in seed traits that would be indiscernible in ocular inspection. In this work, in order to cluster 20 cultivars of pomegranate seed, nine image features and 21 physicochemical properties of them were extracted. The aim of this study was to evaluate if the information extracted from image of pomegranate seeds could be used instead of time-consuming and partly expensive experiments of measuring their physicochemical properties. After data reduction with principal component analysis (PCA), different kinds of overlapping between these two types of data were controlled. The results showed that clustering base on all variables of image features contain more similar cultivars with clustering base on physicochemical properties (66.67% for cluster 1, 75% for cluster 2, and 50% for cluster 3). Therefore, by applying image analysis technique, the seeds almost were placed in different pomegranate clusters without spending time and additional costs.Entities:
Keywords: Clustering; Image analysis; PCA; Pomegranate seed
Year: 2017 PMID: 29387357 PMCID: PMC5778205 DOI: 10.1002/fsn3.475
Source DB: PubMed Journal: Food Sci Nutr ISSN: 2048-7177 Impact factor: 2.863
Figure 1Schematic view of color measurement for a seed of MDSiR cultivar
Figure 2Schematic view of preparing images to determine the morphological parameters of VJG cultivar (a=original, b= make binary & threshold (Autso), c=median filter (r = 2 pixel), d=dilation)
Results of the PCA for image features and physicochemical properties
| Principal components | Eigen value | % Variance | Cumulative variance % |
|---|---|---|---|
| Image features | |||
| PC1 | 3.61 | 40.09 | 40.09 |
| PC2 | 2.20 | 24.47 | 64.56 |
| PC3 | 1.69 | 18.78 | 83.34 |
| PC4 | 1.05 | 11.62 | 94.96 |
| Physicochemical properties | |||
| PC1 | 5.91 | 28.12 | 28.12 |
| PC2 | 4.10 | 19.55 | 47.67 |
| PC3 | 2.77 | 13.17 | 60.84 |
| PC4 | 2.25 | 10.70 | 71.54 |
| PC5 | 1.45 | 6.90 | 78.44 |
| PC6 | 1.28 | 6.08 | 84.51 |
Eigenvectors (EV) and correlations (R) between variables and PCs of image features
| Variable | PC1 | PC2 | PC3 | PC4 | ||||
|---|---|---|---|---|---|---|---|---|
| EV | R | EV | R | EV | R | EV | R | |
| 1. Area | −0.28 | −0.53 | 0.42 | 0.62 | 0.00 | 0.01 | −0.53 | −0.54 |
| 2. Perimeter | −0.48 | −0.91 | 0.24 | 0.36 | 0.08 | 0.10 | −0.12 | −0.13 |
| 3. Circularity | 0.50 | 0.95 | −0.04 | −0.07 | −0.10 | −0.13 | −0.22 | −0.23 |
| 4. Roundness | 0.41 | 0.78 | 0.36 | 0.54 | −0.09 | −0.12 | −0.04 | −0.04 |
| 5. Solidity | 0.45 | 0.86 | −0.07 | −0.10 | −0.14 | −0.18 | −0.28 | −0.29 |
| 6. MFD | 0.12 | 0.23 | 0.63 | 0.93 | −0.02 | −0.02 | −0.19 | −0.20 |
| 7. L value | 0.16 | 0.30 | 0.00 | −0.01 | 0.72 | 0.94 | −0.09 | −0.09 |
| 8. a value | 0.03 | 0.06 | 0.39 | 0.58 | −0.37 | −0.49 | 0.60 | 0.61 |
| 9. b value | −0.17 | −0.31 | −0.29 | −0.43 | −0.54 | −0.71 | −0.42 | −0.43 |
Eigenvectors and correlations between variables and PCs of physicochemical properties
| Variable | PC1 | PC2 | PC3 | PC4 | PC5 | PC6 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EV | R | EV | R | EV | R | EV | R | EV | R | EV | R | |
| 1. Fruit length | 0.12 | 0.28 | 0.40 | 0.82 | −0.09 | −0.14 | −0.03 | −0.05 | 0.08 | 0.10 | 0.14 | 0.16 |
| 2. Fruit diameter | 0.05 | 0.13 | 0.46 | 0.94 | 0.03 | 0.05 | −0.05 | −0.08 | −0.12 | −0.15 | 0.09 | 0.11 |
| 3. Fruit volume | 0.06 | 0.14 | 0.46 | 0.93 | 0.03 | 0.05 | −0.13 | −0.20 | −0.14 | −0.17 | 0.06 | 0.07 |
| 4. Fruit density | 0.32 | 0.76 | −0.19 | −0.38 | −0.13 | −0.22 | 0.05 | 0.08 | 0.00 | 0.00 | −0.13 | −0.15 |
| 5. Calix length | −0.10 | −0.25 | 0.13 | 0.27 | 0.16 | 0.26 | 0.12 | 0.18 | −0.26 | −0.31 | 0.20 | 0.23 |
| 6. Calix diameter | −0.21 | −0.50 | −0.01 | −0.01 | −0.16 | −0.27 | 0.16 | 0.25 | −0.45 | −0.54 | 0.14 | 0.16 |
| 7. Thickness skin | −0.35 | −0.84 | 0.14 | 0.28 | −0.15 | −0.25 | −0.08 | −0.13 | −0.03 | −0.03 | 0.13 | 0.15 |
| 8. Skin/fruit % | −0.39 | −0.95 | 0.09 | 0.18 | −0.04 | −0.07 | 0.02 | 0.03 | 0.07 | 0.09 | −0.05 | −0.06 |
| 9. Aril/fruit % | 0.40 | 0.96 | −0.10 | −0.20 | 0.03 | 0.06 | −0.05 | −0.07 | −0.02 | −0.03 | 0.03 | 0.03 |
| 10. Seed humidity weight | 0.21 | 0.52 | 0.33 | 0.66 | −0.12 | −0.18 | 0.22 | 0.33 | 0.19 | 0.23 | −0.14 | −0.16 |
| 11. Seed/fruit % | 0.23 | 0.55 | 0.15 | 0.30 | −0.13 | −0.22 | 0.33 | 0.50 | 0.32 | 0.39 | −0.20 | −0.23 |
| 12. Juice volume | 0.33 | 0.81 | 0.12 | 0.24 | 0.07 | 0.12 | −0.24 | −0.37 | −0.24 | −0.29 | 0.12 | 0.13 |
| 13. Juice density | −0.05 | −0.13 | −0.16 | −0.33 | −0.22 | −0.37 | −0.20 | −0.30 | 0.16 | 0.20 | 0.59 | 0.67 |
| 14. Juice fruit/fruit % | 0.34 | 0.82 | −0.19 | −0.38 | 0.08 | 0.13 | −0.17 | −0.25 | −0.18 | −0.21 | 0.16 | 0.18 |
| 15. pH | 0.08 | 0.19 | 0.07 | 0.14 | 0.51 | 0.85 | 0.20 | 0.30 | 0.00 | −0.01 | 0.17 | 0.20 |
| 16. T.S.S | 0.09 | 0.21 | −0.01 | −0.03 | 0.31 | 0.51 | 0.39 | 0.59 | 0.10 | 0.12 | 0.42 | 0.47 |
| 17. TA (mg.100 g) | 0.11 | 0.26 | −0.24 | −0.49 | −0.40 | −0.66 | 0.21 | 0.32 | −0.13 | −0.15 | 0.08 | 0.09 |
| 18. Anthocyanin (mg.100 g) | 0.19 | 0.46 | 0.12 | 0.24 | −0.33 | −0.55 | −0.26 | −0.39 | 0.14 | 0.17 | 0.30 | 0.34 |
| 19. Total phenolics (mg.100 g) | 0.07 | 0.18 | −0.05 | −0.10 | −0.13 | −0.21 | 0.48 | 0.72 | −0.36 | −0.44 | 0.13 | 0.15 |
| 20. Total sugars (mg.100 g) | −0.10 | −0.24 | −0.06 | −0.11 | 0.01 | 0.01 | 0.19 | 0.28 | 0.50 | 0.60 | 0.32 | 0.36 |
| 21. Antioxidant % | 0.03 | 0.07 | 0.19 | 0.39 | −0.42 | −0.69 | 0.25 | 0.38 | −0.07 | −0.08 | 0.00 | 0.00 |
Results of agglomerative hierarchical clustering (AHC) based on different variables
| Clustering base | Variable | Cluster no. | Cultivars | Objects | Within‐class variance | Average distance to centroid |
|---|---|---|---|---|---|---|
| Image features | All variables | 1 | SPS, MPN, SPK, PSD, PSA, DA | 6 | 9.69E + 04 | 2.51E + 02 |
| 2 | MS, VJG, ZY, GSY, MMS, KB, MB, SK, TSF, LPK | 10 | 2.64E + 04 | 1.20E + 02 | ||
| 3 | MPG, MDSiR, MDSR, MPS | 4 | 1.08E + 05 | 2.42E + 02 | ||
| PC scores | 1 | SPS, VJG, MPG, MDSR,PSD, MPS, DA | 7 | 3.02E + 04 | 1.31E + 02 | |
| 2 | MPN, SPK, MDSiR, PSA | 4 | 1.50E + 04 | 9.77E + 01 | ||
| 3 | MS, ZY, GSY, MMS, KB, MB, SK, TSF, LPK | 9 | 2.10E + 04 | 1.01E + 02 | ||
| PC indicators | 1 | SPS, MPG, ZY, SK, TSF | 5 | 6.73E + 00 | 2.20E + 00 | |
| 2 | MPN, VJG, SPK, PSD, GSY, PSA, DA, LPK | 8 | 4.85E + 00 | 1.86E + 00 | ||
| 3 | MS, MDSiR, MDSR, MMS, MPS, KB, MB | 7 | 1.50E + 01 | 3.46E + 00 | ||
| Physicochemical traits | All variables | 1 | SPS, MS, MDSR, PSA, MMS | 5 | 1.20E+07 | 3.02E + 03 |
| 2 | MPN, VJG, SPK, MPG, PSD, ZY, GSY, DA,SK, TSF, LPK | 11 | 5.04E + 06 | 1.88E + 03 | ||
| 3 | MDSiR, MPS, KB, MB | 4 | 6.09E + 06 | 1.85E + 03 | ||
| PC scores | 1 | SPS, MS, KB, TSF, LPK | 5 | 2.42E + 06 | 1.31E + 03 | |
| 2 | MPN, SPK, MPG, MDSiR, MDSR, GSY, PSA, MMS, MPS, MB | 10 | 1.52E + 06 | 1.14E + 03 | ||
| 3 | VJG, PSD, ZY, DA, SK | 5 | 1.82E + 06 | 1.16E + 03 | ||
| PC indicators | 1 | SPS, MPN, SPK, MDSR,GSY, PSA | 6 | 3.06E + 06 | 1.19E + 03 | |
| 2 | MS, VJG, PSD, ZY, DA, SK, TSF, LPK | 8 | 1.11E + 06 | 8.18E + 02 | ||
| 3 | MPG, MDSiR, MMS, MPS, KB, MB | 6 | 2.09E + 06 | 9.84E + 02 |
The percentage of overlapping for image‐based clustering and physicochemical‐based clustering
| Image‐based clustering | Physicochemical‐based clustering | Cluster 1 | Cluster 2 | Cluster 3 |
|---|---|---|---|---|
| PC indicators | All variables | 20% | 63.64% | 100% |
| PC indicators | PC scores | 40% | 40% | 0% |
| PC indicators | PC indicators | 16.67% | 50% | 83.33% |
| PC scores | All variables | 40% | 18.18% | 50% |
| PC scores | PC scores | 20% | 40% | 40% |
| PC scores | PC indicators | 33.33% | 0% | 50% |
| All variables | All variables | 40% | 54.55% | 50% |
| All variables | PC scores | 20% | 30% | 0% |
| All variables | PC indicators |
|
|
|
Figure 3Dendrogram from hierarchical clustering of the PC indicators of physicochemical properties which groups 20 pomegranate seed cultivars
Figure 4Dendrogram from hierarchical clustering of all variables of image features which groups 20 pomegranate seed cultivars