Literature DB >> 36238676

A Performance Comparison of Classification Algorithms for Rose Plants.

Muzamil Malik1, Waqar Aslam1, Emad Abouel Nasr2, Zahid Aslam1, Seifedine Kadry3.   

Abstract

One of the key roles of Botanists is to be able to recognize flowers. This role has become highly challenging given that the number of discovered flower types are nearing half a million. To support Botanists, Information Technology offers promising solutions. Specifically, machine learning techniques are intrinsically appealing due to being precise enough as required. To this aim, two observations on flower leaves are relevant and leverage flower identification: one, flower plants exhibit unique features in their leaves thus allow distinction of their co-located flowers; two, leaves have a much longer life than flowers thus preserve identity properties longer. This paper proposes the use of machine learning-based identification of rose types by leveraging the features from their leaves. For this purpose, the performance of Naive Bayes, Generalized Linear Model, Multilayer Perceptron, Decision Tree, Random Forest, Gradient Boosted Trees, and Support Vector Machine has been analyzed. This study optimizes the RF model by investigating and tuning its various parameters such as the number of trees, the depth of trees, and splitting criteria. The best results are achieved with gain ratio because it takes more distinct values to avoid the problems associated with Information Gain. Optimizing the number of trees and the depth of trees of RF yield better accuracy than other models. Extensive experiments are performed to analyze the results of ensemble algorithms by using the voting method for each instance. Results suggest that the performance of ensemble classifiers is superior to that of individual models.
Copyright © 2022 Muzamil Malik et al.

Entities:  

Mesh:

Year:  2022        PMID: 36238676      PMCID: PMC9552688          DOI: 10.1155/2022/1842547

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

Pakistan has an agriculture-based economy in which the horticulture profession is common. For agriculture implants, most traditional resources are used and the population is massively growing due to which national production requirements are hardly met. The cost-benefit ratio in the agriculture industry is suboptimal and requires the adoption of new technologies and automated processes. To this end, one interesting area of automation is image processing for effective usage in horticulture. Machine learning in image processing has met great success to solve real-world problems such as detection and classification of cancerous tissues, face recognition, crop/plant classification, and image-based searching [1]. Plant classification has been a very important research area for many decades. So far about 250,000 kinds of flowering plants have been identified and classified [2]. Researchers have been trying to make the classification of fruit, vegetable, and flowering plants an easy process with lesser manual involvement. Amongst flowering plants, rose plants have universal appeal due to their matchless beauty. They have economic value due to demand around the globe as being used to prepare medicines, cosmetics, perfumes, oils, etc. [3]. The Netherlands is home to the largest rose farms in the world [4]. It is increasingly becoming relevant and significant to keep track of not only existing rose species but also to identify new ones. Rose plants vary in their morphological characteristics, which may affect their leaves, flowers or even the entire plants [5]. Rose leaves contain key knowledge and survive longer than the roses. Identifying rose plants or in general flowering plants through their leaves is a troublesome task for plant scientists if done manually. It requires appropriate training, time, and manpower to perform this task, especially if done at a larger scale. Given that roses have about 150 species that vary in colours, sizes, and fragrance, their manual identification is still tedious and time-consuming. There is a need for an efficient approach, adopting which this task can be performed on large scale using the available automation technologies. Thus, the main objective of this work is to identify rose types automatically. With the high availability of smart mobile phones, the idea is to develop an expert application that can classify roses, thus effectively eliminating the involvement of plant scientists. This application can use the built-in phone camera to capture rose leave images for roses classification [5]. To this end, our preliminary work is based on the k-nearest neighbour (k-NN) algorithm [6]. Random forest (RF) is one of the widely used machine learning models for classification tasks that uses “wisdom of the crowd” to make the final prediction. RF is a good choice when it comes to the problem of high dimensional and imbalanced data [7, 8]. The accuracy is better than other machine learning models as it uses the mean or average of many decision trees for the final decision. Currently, it is being employed in many domains like health care, prediction for time series data and agriculture, etc. [9, 10]. In general, this study makes the following contributions: A methodology is designed to perform automatic rose classification using rose leaves. For this purpose, the image processing approach is followed. Two sets of features are tested for this task including histogram and texture features. Four subsets of features are evaluated through extensive experiments. The performance of Naive Bayes (NB), Generalized Linear Model (GLM), Deep Learning (DL) Multinomial Model, Decision Tree (DT), RF, Gradient Boosted Trees (GBT), and Support Vector Machine (SVM) is analyzed in detail. For experiments, a new dataset of rose leaves is collected. Ensemble classifiers are tested for the classification task using various combinations of selected machine learning models with four subsets of features to analyze the classification accuracy. This study especially focuses on the performance of RF which shows better results than other models for rose plant classification. Due to good results of RF, its performance is further improved by analysing the influence of various selection criteria such as Information Gain, Gini Index, etc. The rest of the paper is organized as follows. Section 2 discusses research papers from the literature which are closely related to the current study. Section 3 gives an overview of the machine learning algorithms adopted for the current research, the description of the dataset used for the experiment, as well as, the proposed approach. Results are discussed in Section 4. In the end, the conclusion is given in Section 5.

2. Literature Review

Classification of plants carries multiple purposes such as to name plants, extract useful information, study features that impact yield of fruits/vegetables and quality, and predict their price. Some representative classification approaches are presented next.

2.1. Machine Learning Models

Machine learning offers reliable algorithms for predictability [11, 12]. For predicting prices of various varieties of fruits, they have been classified using a hybrid method based on texture, histogram and colour features [13]. The proposed algorithm FSCABC–FNN obtained 89.1% classification accuracy. Tomatoes have been graded for readiness using colour traits in [14]. Principal components analysis and SVM are used for feature extraction and linear discriminant for categorization. Results show 90.80% accuracy. Quality assessment and disease detection of sunflowers using texture and colour traits obtained from leaves has been studied using multi-class SVM, k-NN, Multinomial Logistic Regression, and NB in [15]. Another work suggests the importance of structural cues for flower identification [16]. The feature vector is built and input to the proposed method. The accuracy is increased from 76.9% to 82.6%. A performance comparison for classification of plants using computer vision is presented in a survey [17]. Plant organs, information on different features namely vein structure, colour, shape, margin, and texture are studied. Texture features in combination with leaf traits are found to be the best for identification. The authors build a vision-based leaf identification system in [18]. The study uses different features including the shape, inner structure, colour, surface For this purpose. Similarly, [19] designs a mobile-based leaf identification system that first determines the leaf and non-leaf samples and then classifies the leaves. Curvature-based shape features are used in this regard. Experimental results show the efficiency and robustness of the proposed system. Rose plant classification is carried out in [6] that deploys k-NN based approach to this end. Using a different number of neighbours for k-NN, experiments are performed with histogram and texture features. The obtained accuracy for histogram and texture features are 65.00% and 45.50%, respectively. Similarly, [20] endeavours to classify eight types of flowers using scale-invariant feature transform (SIFT) features. SVM and RF are applied for the classification of the features from segmented images. Rather a short dataset containing 215 images is used for classification where high accuracy is achieved when flowers of dissimilar shapes are classified. The authors use pre-trained VGG16 architecture to rose flower disease classification in [21]. Early and late fusion techniques are applied combining VGG16 and SVM where the early fusion models show better results with 88.33% accuracy. The study [22] provides a comprehensive review of machine learning models that are recently adopted for species classification. The study especially covers the vision-based approaches applied for flower classification and discusses the famous pre-trained models. An AI-based guava disease prediction system is presented in [23] that utilizes the high-quality images of guava leaves. The efficiency of several machine learning classifiers is evaluated like k-NN, complex tree, boosted tree, bagged tree, and SVM. Additionally, the use of histogram and textual features prove to show higher accuracy.

2.2. Use of Deep Learning Models

Predictability through deep learning promises improvement over machine learning algorithms, as demonstrated in various dynamic problem areas such as cloud computing [24]. In the context of this work, the quality and defects of the Jasmine flower have been identified with an 83% efficiency using texture, colour, and shape traits [25]. Plants have been identified using leaf features with images taken directly from plants [26]. The used method is based on a convolutional neural network (CNN) and features based on a deconvolutional network. It is found that shape features are inadequate for identification due to less discriminatory information contained in the leaves. On the contrary, venation structure and leaf shape features give better results. Similarly, species classified using deep learning is found a promising approach [27]. A non-scalable manual approach is proposed in which visual characteristics have been selected from flower images and generalized to predict new unknown flowers [28]. The proposed method proved to be more effective than traditional approaches. There have been efforts to understand roses in detail and to recognize their variety [29]. The work is based on Fourier Transformation that considers descriptor angles of roses to recognize their shapes like round, irregular round, or star. The obtained efficiency is higher than other contemporary methods. Similarly, a system of neural network-based classification has investigated blooming flowers [30]. A detailed analysis of different machine learning classifiers like NB, DT, simple k-Means, MLP, SVM, and RF using the WEKA tool is presented in [31]. The main objective is to find the best classification algorithm to enhance the accuracy of classification with reduced processing time. Many evaluation parameters are used for analysing results such as mean absolute error, root means squared error, TP-rate, TN-rate, FP-rate, FN-rate, precision, and recall. Results indicate NB is the best choice for improving traditional classification problems. SVM gives the best average accuracy. Plant classification using a CNN is performed in [32] which used the BJFU100 dataset containing 100 species of iris plant. Images are added to the dataset using a mobile device application to collect more images. The residual network is introduced which removes the vanishing gradient and degradation problems. The proposed network is 26 layers in-depth and allows the input flow to deeper layers without losing information. The parameters of RestNet26 are well trained that they can learn the discriminative features and avoid under fitting. The proposed approach can achieve an accuracy of 91.78% which is better than the existing RestNet with 18, 34, and 50 layers. Another deep learning approach called a Fully Convolutional Network (FCN) is proposed in [33] for plant classification. It performs automatic segmentation of flowers from the background. It used the VGG-16 model for initialization. FCN has several convolutional layers and 3 deconvolutional layers. FCN is trained until the validation accuracy starts decreasing and training is restarted from the last learned model. The objective is to let the model learn local features in the first two blocks. By this process they collected the segmented flower images and kept only those that had high discriminative region, other images are discarded. A CNN is proposed to be trained on FCN. Evaluation metrics are proposed for measuring the accuracies of the segmentation, detection, and classification methods. Results show the accuracy of 99.0%, 98.5%, and 97.1% on Zou-Nagy, Oxford 17, and Oxford 102, respectively. Along the same lines, [34] presents the use of multiple deep learning models and combines auto encoders and CNN for plant leave classification. The auto encoder and CNN are used for feature extraction which is later used to train an SVM for classification which yields better results than traditional machine learning models. The authors utilize low-quality images in [35] with deep learning models to improve the performance for plant disease prediction. The study utilizes Chebyshev orthogonal functions and probability distribution functions regarding the colour histograms. Experiments performed using the MobileNetV2 show better performance over traditional methods. Similarly, a mask residual CNN (RCNN) based approach is presented in [36] recognizing to detect diseases from apple leaves. Experiments using the Plant Village dataset yield a 96.6% accuracy using the ensemble subspace discriminant analysis. In a similar fashion, a residual NN (RNN) is used by authors in [37] for detecting cassava mosaic disease. A modified deep RNN is designed for disease detection with a balanced dataset using block processing. With a balanced dataset high accuracy is reported with deep RNN showing 9.25% better performance than a traditional CNN.

2.3. Use of Selective Features for Classification

A flower classification technique is introduced in [38] which uses selective discriminative features for 103 class datasets. Images are downloaded from the web which varies in scale, resolution, clutter, lighting, quality, etc. An automatic segmentation scheme introduced by Nilsback and Zisserman is used. Colour, histogram of gradient (HOG) features are used using SIFT for the foreground region. SVM is used as a classifier where each kernel represents each feature. Classification results are much better with combined features within the kernel framework, which improves efficiency. Study [39] described different perspectives of image acquisition and its impact on classification accuracy. Three image factors are considered: perspectives, illumination, and background. CNN is used for feature extraction and SVM for classification. Total 27 datasets are formed using nine image types (backlight on/off, plain background/natural background, top view, and back view of the leaf) and three pre-processing strategies are used pre-processed, cropped, and segmented. The highest accuracy of 91% on cropped backlight images and 55% lowest accuracy is achieved on non-pre-processed images. It is found that cropping is more effective than segmenting, backside images do not contribute to achieving higher accuracy but need more human efforts in image acquisition. The non-destructive way is to take topside images and crop them from leave boundaries of herbaceous leaves. If the destructive way is permissible plucking the leaf using backlight yields higher accuracy. Spatio-temporal features have also been utilized with deep learning models for prediction and classification as in [40] for crow flows prediction. Similarly, the Spatio-temporal features are used with hybrid deep learning models in [41]. An attention-based network is designed in [42] that makes use of Spatio-temporal features for traffic flow prediction. The authors developed an image capturing scheme in [43] for obtaining the best perspectives that contribute to the classification accuracy of flowering plants. The images of a single plant are taken considering five different perspectives: entire plant, flower frontal, flower lateral, leaf top, and leaf back. A large dataset comprising 101 species of plant families is assembled. Images are taken during the flowering season. A CNN is trained on the collected dataset shows that CNNs can learn the discriminative features directly from raw pixels. Transfer learning is used for training while the performance is evaluated using a simple sum rule that combines the scores of different perspectives. An accuracy of 77.4% is achieved for the entire plant, 88.2% for flower frontal, and the best results are achieved by fusing all five perspectives giving an accuracy of 97.1%. It is concluded that the species that are difficult to recognize even by humans can be recognized by multi-organ identification. Because of the lack of a universal perspective for all species, different organ views of the plant are beneficial for identifying the important perspectives of plants.

3. Methodology

In this section, we discuss the proposed methodology. The steps followed in the proposed methodology are shown in Figure 1 and briefly described in the preceding sections.
Figure 1

Steps followed in the proposed methodology. Starting with data collection, study follows data processing before feature extraction. In the end, classification is performed.

3.1. Data Collection

Machine learning algorithms learn on the available data or evidence. Mistakes in data collection are easily propagated to the training phase and affect the performance of classifiers. Thus, we have collected data carefully using a 23MP camera capturing orange, red, pink, and white rose leaves (please see Figure 2). Images are taken in a controlled environment keeping the light condition the same for all images. Each image comes from a different plant. The images are captured in a controlled environment where the lighting conditions are almost similar for all the captures. The dataset consists of 10 classes. The resolution of the captured image is 1080 × 1920 pixels. The data is split into 0.6 to 0.4 ratios for training and testing, respectively.
Figure 2

Leaves of different colored roses, (a) Orange rose, (b) Pink rose, (c) Red rose, and (d) White rose. The leave images are from the dataset collected for experiments in this study.

3.2. Data PreProcessing

We convert colour images to grey level images and mark two regions of interest (ROIs) on each of them using the CVIP tool [44]. Thus, dataset of size 200 is formed. Conversion into grey level aims at reducing the unnecessary information from the images and computational processing. Pre-processing allows feature enhancement and should be carried out carefully to avoid losing vital information that can lead to wrong identification.

3.3. Feature Extraction

Feature selection has a direct impact on the classification process. Feature space can easily grow enormously, hence extracting a minimal set of features is desirable though it is computationally intensive. Too much extraction of features may easily compromise generalization of results. Figures 3 and 4 show the examples of natural and artificial textures.
Figure 3

Examples of artificial textures.

Figure 4

Examples of natural textures.

Table 1 shows the histogram and texture features used in this study. Each feature is defined next.
Table 1

The list of features used for different classifier.

Histogram featuresTexture features
MeanEnergy
Standard deviationInertia
SkewCorrelation
EnergyInverse difference
EntropyEntropy

3.3.1. Mean

Histogram mean describes the average level of intensity of the image or texture being examined [45]. Mathematically it is given as,where p(i) is the fraction of samples in class i and G is the number of grey levels used [6].

3.3.2. Skew

Skewness provides the data with distribution, whether or not the resulting distribution is symmetric, positively skewed, or negatively skewed [31]. It is given as [6],

3.3.3. Energy

Energy feature measures the contrast between a pixel and its surrounding pixels [32]. It gives a large value if the image is homogeneous. Homogeneous means there are a large number of pixels that have similar intensity values. If this feature gives a positive 1, it means the image is constant. It is given as,where i, j are the spatial coordinates of the function p(i, j) [6].

3.3.4. Entropy

It varies inversely with the energy, while it is defined as the number of bits needed to code the data [46]. It is given as,where i and j are the fractions of examples in class i and j, respectively [6].

3.3.5. Inertia

Inertia is an image moment and shows a weighted average in terms of intensity of image pixels. It is calculated usingwhere i, j are the spatial coordinates of the function p(i, j) [6].

3.3.6. Correlation

It is the relationship between two values. The coefficient of correlation lies between 1 and −1. A value near 1 means there is a positive correlation between nearest pixel values, while a value closer to -1 means there is a negative correlation between them [32, 46]. It is given aswhere μ and μ are the means and σ and σ are the standard deviations of p and p, the partial probability functions [6].

3.3.7. Inverse Difference

It is the local homogeneity that is high when the local grey level is uniform [46, 47]. It is given aswhere p(i, j) is the probability that a pixel with value i will be found adjacent to a pixel of value j [6]. Selection of histogram and texture features is based on their success rate in similar classification problems [48-51]. The contribution of histogram due to its brightness and contrast aspects is proven [30]. For classification through leaves texture feature has a vital role. Texture not only considers leaf venation structure but also gives the directional characteristics of pixels selected from the leaf. It is independent of leaf colours and shape. Texture analysis is made from a group of pixels. It is considered more dominant a feature than the shape feature [5, 17]. CVIP is a famous library used for simple to complex image processing tasks like image reading, transforming, and region of interest (ROI) capturing [52]. It also provides automated tools to control the quality of images and image enhancement. It is used with a graphical user interface (GUI) based software tools like LabView where numerical and statistical analysis can also be performed.

3.4. Deep Learning Models

In addition to machine learning models, this study implements deep learning models for rose plant classification.

3.4.1. Convolutional Neural Network

CNN is a widely used deep learning model for image processing tasks [1]. Good results can be obtained by CNN as it can efficiently handle data complexity and pre-processing. It includes a convolutional layer to learn complex features from the input data while max-pooling is used as the pooling layer in this study. The convolution layer is used with rectified linear unit (ReLU) activation while the kernel size is 3 × 3. Max-pooling is used with 2 × 2. It is followed by a flatten layer and 0.2 dropout layer to reduce the probability of model over fitting. A dense layer is used with 512 neurons.

3.4.2. Long Short-Term Memory Network

This study also uses the LSTM model for rose plant classification. LSTM has four gates, each for a different task. LSTM has a feedback mechanism and produces good results for classification tasks [53]. LSTM is used for an embedding layer with dimensions of 5000 and 100. It is followed by a dropout layer with a 0.5 dropout rate. Then an LSTM layer is added with 100 units. In the end, a dense layer is added with a “softmax” activation layer to get the output for the desired number of classes.

3.4.3. RestNet

The RestNet also called residual network is a pre-trained model and is among the commonly used pre-trained models for tasks related to image processing. RestNet aims at providing high accuracy for complex image processing tasks [54]. It has different layers where each layer has a different structure with respect to convolutional size and filters. Possessing a deep structure, RestNet can learn better by going deeper during the training phase and ultimately provides better results than traditional deep learning models [55].

4. Results and Discussion

We used CVIP tool to extract features and RapidMiner [56] for classification. Small result subsets of texture and histogram related features are listed in Tables 2 and 3, respectively. The rest of the results are not listed for brevity.
Table 2

Few sample results of histogram features.

LabelsROIMeanStandard deviationSkewEnergyEntropy
Orange129.4883.5310.4640.0843.790
228.5684.1000.2220.0684.023
332.1477.2960.1600.0404.837

Red131.2492.0790.0530.1403.075
222.4220.9990.3410.2952.013
328.9892.3550.2460.1213.237

White138.6963.0341.5870.1333.334
254.2092.4940.2620.1163.347
364.6743.3400.3970.0863.757

Pink178.3372.4190.4440.1213.271
273.4672.5790.9030.1233.288
363.0282.6860.2730.1063.451
Table 3

Few sample results of texture features.

LabelsROIEnergyInertiaCorrelationInverse differenceEntropy
Orange10.01034.2950.2480.5320.543
20.00214.5190.3060.4340.400
30.0018.9400.2410.4930.244

Red10.0410.8940.2380.4320.351
20.0092.9340.2680.5090.271
30.0131.6570.2340.5160.299

White10.0027.5700.2230.4990.266
20.00116.1150.2480.4250.175
30.0027.3040.2110.4960.307

Pink10.00114.2720.2300.4290.115
20.0018.7120.3640.4680.124
30.00135.0200.4910.3400.317

4.1. Formation of Feature Sets

In this work, results were obtained by using seven different classifiers as described above. We made four feature sets by using the auto model of the RapidMiner tool. These feature sets are made by selecting the features from the set of Histogram and Texture features which are extracted from the images of rose leaves. Results obtained by all classifiers on these feature sets are different for a different set of features. Different sets of features are described below.

4.1.1. Feature Set 1

All extracted features of histogram and texture are used to make feature set 1 (FS1). Histogram mean, histogram standard deviation, histogram energy, histogram skew, histogram entropy, texture energy average, texture energy range, inertia average, inertia range, correlation average, correlation range, inverse difference range, inverse difference average, and texture entropy average.

4.1.2. Feature Set 2

Features having a high correlation with the label column are selected from both histogram and texture features to make feature set 2 (FS2). Histogram mean, histogram standard deviation, histogram energy, histogram entropy, texture energy average, texture energy range, inertia average, inertia range, inverse difference range, and texture entropy average.

4.1.3. Feature Set 3

Feature set 3 (FS3) contains only histogram features with a high correlation to the label column. Histogram mean, histogram standard deviation, histogram energy, and histogram entropy.

4.1.4. Feature Set 4

Set of all texture features with high correlation to labels are used to make feature set 4 (FS4). Texture energy average, texture energy range, inertia average, inertia range, inverse difference range, and texture entropy average.

4.2. Error Metrics

To correctly evaluate the performance of classifiers confusion matrices are used which have four different values. True positive (TP) show the number of those instances that are classified correctly, true negative (TN) represent instances that are true but classified incorrectly, false negative (FN) represent those instances that are false and classified as false while false positive (FP) represents the number of instances that are false but classified as true. Based on these values precision and recall can be evaluated [13]. Recall: This term represents the probability that how many positive classes are recalled by our classifier. The term is defined by Precision: This term represents the probability that how many true positives were found by our classifier. The term is defined by In the end, the classification error is used which represents the percentage of incorrect class predictions.

4.3. k-Nearest Neighbour

In our previous work we used CVIP for k-NN to obtain results on histogram and texture features, the obtained accuracy is 65% and 45.50%, respectively [6]. We formed two feature sets including histogram features and texture features. Figure 5 shows the result of k-NN on different values of k.
Figure 5

Performance of k-NN on texture and histogram features.

4.4. Parameter Settings of Machine Learning Models

We tuned some general parameter settings of machine learning models in RapidMiner and the rest of the parameters are used by their default values. The split operator in the process model makes partitions of data as training set and testing set. It takes 60% examples for training and 40% for testing. Another parameter is sampling types which are automatic, linear, shuffled and stratified. For nominal data types, we used stratified sampling. The parameter settings of NB have the Laplace correction parameter which avoids the conditional probability set to zero and also avoids misleading results. It is a kind of Boolean operator with a default value that is true. GLM has a few parameter settings: family, solver, the maximum number of threads, and regularization. The family parameter has different types including Gaussian, Binomial, Poisson, Gamma, Multinomial, Tweedie, and Auto. Gaussian is used for numeric data (real or integer), Binomial for binomial data, Multinomial for polynomial data more than two classes, Poisson is used for numeric and non-negative data, Gamma is applied for continuous, numeric, and positive data, Tweedie is used for numeric, continuous and non-negative and Auto option selects multinomial for polynomial, binomial for binomial and Gaussian for numeric data. Solver parameter includes IRLSM, L_BFGS, Coordinate_Descent_Naive and Coordinate_Descent. IRLSM is useful for the problems which have a small predictor size, L_BFGS better works on the dataset with many columns, Coordinate_Descent_Naive and Coordinate_Descent are the updated versions of IRLSM. The maximum number of threads is used to make the model reproducible and its default value is 4. Regularization uses the lambda and alpha parameters for controlling the regularization and distribution respectively. Standardize is used for numeric columns to have zero mean and unit variance. A complete list of all parameters used for the machine learning models is provided in Table 4.
Table 4

The parameters used for the machine learning models.

Machine learning modelsTuning parametersValues used
NBLaplace correctionYes

GLM Family (Gaussian, binomial, Poisson, gamma, tweedie, auto)Auto
Solver (IRLSM, L_BFGS, Coordinate_Descent, C_D_Naive)Auto
No. of threads1
RegularizationLambda, alpha

DL Activation function (tanh, rectifier, maxout, ExpRectifier)Rectifier
Hidden layer size50
Epochs10
Adaptive learningEpsilon, rho
StandardizeLoss function, distribution function

DT Criteria (Gain_Ratio, Information_Gain, gini index, accuracy, Least_Square)Gain_Ratio
Maximal depth7
PruningYes
PrepruningYes

RFNo. of trees100
Maximal depth7
CriterionGain_Ratio

GBTNo. of trees60
No. of threads1
Maximal depth4
Learning rate0.1
Sample rate1.0
DistributionAuto

SVM SVM type (C_SVC, nu_SVC, one_class, epsilon_SVR, nu_SVR)Rectifier
Kernel typerbf
C1000.0
Cache size80
Epsilon0.001
Parameters for DL multinomial model are activation function, hidden layer sizes, epochs, adaptive learning and standardization. The activation function is used by neurons in the hidden layers to normalize the output. The activation function is of types Tanh, Rectifier, Maxout and ExpRectifier. The hidden layer size parameter is used to change the size of hidden layers. Epochs are used for iterating the dataset. Adaptive learning is used to avoid the slow convergence by combining learning rate and momentum training. For this purpose, it uses epsilon and rho. Standardize is used for regularization, it has different parameters; L1, L2, max w2, loss function, and distribution function. The loss function is of types Automatic, Quadratic, CrossEntropy, Huber, Absolute, and Quantile. The distribution parameter has sub-parameters of type Auto, Bernoulli, Gaussian, Poisson, Gamma, Tweedie, Quantile, and Laplace. DT parameters include criterion, maximal depth, pruning, and pre-pruning. Criterion includes Gain Ratio Information Gain, Gini Index, Accuracy, and Least Square. Maximal depth is used for varying the size of the tree according to example set. If the pruning parameter is checked it will replace some branches with leaves according to the confidence parameter and pre-pruning specifies the stopping criteria for the generation of the tree. Random Forest parameters are the same as DT except for the additional parameter number of trees. GBT parameters are the number of trees, maximum number of threads, maximal depth, learning rate, sample rate, and distribution. The distribution parameter has the same types that were defined for Deep Learning. SVM parameters include SVM type, kernel type, gamma, C, cache size, epsilon, shrinking, and confidence with multi-class. SVM type is used to select the type of SVM which are C_SVC, nu_SVC, one_class, epsilon_SVR, and nu_SVR, first two types are used for classification tasks, epsilon-SVR and nu-SVR are used for regression, and distribution estimation one-class SVM is used. Kernel type parameter includes linear, poly, RBF, sigmoid, and pre-computed. RBF kernel is the default type; it maps the samples into high dimensional space using a nonlinear function. Gamma is used with poly, sigmoid, and RBF kernels and play important role in changing the accuracy of the model. C specifies the cost parameter and it is used with SVM types C_SVC, epsilon_SVR, and nu_SVR. Epsilon is used for termination criteria. Tables 5–8 show precision, recall, classification error and accuracy of all classifiers on feature sets FS1, FS2, FS3 and FS4.
Table 5

Precision, recall, and classification error on FS1.

Classifiers Precision (%)Recall (%)ClassificationError (%)Accuracy (%)
NB54406040
GLM67643565
DL71683169
DT49465248
RF60603862
GBT66673268
SVM71712872
Table 6

Precision, recall, and classification error on FS2.

Classifiers Precision (%)Recall (%)Classification error (%)Accuracy (%)
NB49415842
GLM60594060
DL74693169
DT62594060
RF68673169
GBT62623664
SVM67673268
Table 7

Precision, recall, and classification error on FS3.

Classifiers Precision (%)Recall (%)Classification error (%)Accuracy (%)
NB63455545
GLM59643565
DL75663466
DT64554456
RF69673268
GBT70683169
SVM70643466
Table 8

Precision, recall, and classification error on FS4.

ClassifiersPrecision (%)Recall (%)Classification error (%)Accuracy (%)
NB35386040
GLM45405842
DL38445545
DT28405941
RF34415743
GBT38435545
SVM35346634
In Figure 6 it is shown that precision, recall, and accuracy of NB and DT are lower than other classifiers. The higher the accuracy, the lower will be the classification error. SVM obtained the highest accuracy of 72% on FS1 as compared to other classifiers.
Figure 6

Results of Machine Learning classifiers when trained using the FS1.

Accuracy depends on both values of precision and recall. Figure 7 shows the comparison of all classifiers. Here RF, DL, and SVM achieved equal accuracies on FS2. DT showed improvement on this data set by obtaining an accuracy of 60% which was 48% for FS1.
Figure 7

Results of Machine Learning classifiers when trained on FS2.

Classification results by all classifiers are depicted in Table 7 on FS3. This feature set only contains histogram features that have a high correlation with the labeled column. NB showed improvement on FS3 compared to FS1, FS2, and FS4. Figure 8 shows that the precision, recall, and accuracy of all classifiers are the same on FS3.
Figure 8

Results of Machine Learning classifiers when trained with FS3.

Classification results on FS4 are less than other feature sets. All classifiers achieved low accuracy on FS4 because this feature set only contains the texture features with a high correlation to the label column. Texture features alone are not good features for classification tasks when there is a small dataset because it works on patterns. Precision, recall, classification, and accuracy on FS4 are shown in Figure 9. It is shown that the accuracy lies between 34% and 45% which is very low than other feature sets.
Figure 9

Results of Machine Learning classifiers when trained with FS4.

Table 9 shows the accuracy of all classifiers on four feature sets. SVM gave the highest accuracy of 72% on FS1 and lowest accuracy of 34% on FS4.
Table 9

Summary of classifiers accuracy (%) using different feature sets.

Feature setsNBGLMDLDTRFGBTSVM
FS140606948626472
FS242656960696868
FS345656656686966
FS440424541434534

4.5. Ensemble Learning

Ensemble learning is the technique of integrating multiple learners to solve the same problem. It builds multiple sets of hypotheses from training data and uses them together. The generalization ability of the ensemble is greater than the individual learners. It provides a more robust solution where the dataset does not contain equal distribution. The first step is to generate base learners either in a parallel manner or sequential and the second step is to combine all base learners for producing multiple combinations, majority voting and weighted averaging are the popular combination schemes for classification and regression respectively. The accuracy of the ensemble methods can be estimated by cross-validation, hold-out test, etc. For the construction of a useful ensemble method, it needs to consider multiple measures, e.g., subsampling the training examples, manipulating the features, manipulating the output and random integration of algorithms. Bagging, boosting, and voting are the common techniques for combining the algorithms [57]. In our study, we used the voting method to combine the algorithms. First, we carried out experiments for each algorithm, and then their results were combined for each instance of the test data. The instance with the highest votes of a class label has given that label. We used NB, GLM, DT, DL, RF, GBT, and SVM for producing ensemble results. We made eleven combination sets by integrating algorithms. The first set contains all seven algorithms, the second set five algorithms (GLM, DL, RF, GBT and SVM) that achieved higher accuracies and the third set nine random combinations of three algorithms from the second set. Results of these ensemble algorithms on our four feature sets are given in Table 10.
Table 10

Summary of ensemble classifiers accuracy (%) on different feature set.

Ensemble modelsAccuracy (%)
FS1FS2FS3FS4
NB + GLM + DT + DL + RF + GBT + SVM66606339
GLM + DL + RF + GBT + SVM70676735
GLM + DL + RF65656643
GLM + DL + GBT62606845
GLM + DL + SVM65656540
GLM + RF + GBT66666840
GLM + RF + SVM66676736
GLM + GBT + SVM68636735
DL + RF + GBT66676741
DL + RF + SVM67696639
RF + GBT + SVM67656639
The results shown in Table 10 are obtained by integrating the output of each algorithm with the output of other algorithms of each instance by voting mechanism. The accuracy obtained by ensemble algorithms is better than individual algorithm accuracy, only SVM accuracy is higher than the ensemble algorithms accuracy.

4.6. Performance Optimization of Random Forest

We carried out performance optimization of RF results towards the improvement of accuracy. To this end, various combinations of the number of trees and the depth of trees are tested. The number of trees is varied from 2 to 102 with a difference of 10 and the depth of trees from 1 to 100 with the same difference. It is worth mentioning that Rapidminer uses Gain Ratio as an attribute selection criterion for RF as a general approach, though it does not explicitly disclose the value used for the Gain Ratio. We handle this situation concretely for our case by testing the impact of explicit values of the Gain Ratio. Additionally, we also tested two other attribute selection criteria, Information Gain and Gini Index. Results obtained by optimized RF with Gain Ratio achieved higher accuracy than Information Gain and Gini Index. Results obtained by tuning the number of trees and depth of trees with Gini Index, Information Gain, and Gain Ratio as attribute selection criteria are given in Tables 11–23.
Table 11

Accuracy (%) results obtained by Information Gain on FS1.

No. of treesDepth of trees
121416181100
268.2968.2968.2968.2968.2968.29
2268.2968.2968.2968.2968.2968.29
4268.2968.2968.2968.2968.2968.29
6268.2968.2968.2968.2968.2968.29
8268.2968.2968.2968.2968.2968.29
10268.2968.2968.2968.2968.2968.29
Table 12

Accuracy (%) results obtained by Information Gain on FS2.

No. of treesDepth of trees
121416181100
270.7370.7370.7370.7370.7370.73
2270.7370.7370.7370.7370.7370.73
4270.7370.7370.7370.7370.7370.73
6270.7370.7370.7370.7370.7370.73
8270.7370.7370.7370.7370.7370.73
10270.7370.7370.7370.7370.7370.73
Table 13

Accuracy (%) results obtained by Information Gain on FS3.

No. of treesDepth of trees
121416181100
273.1773.1773.1773.1773.1773.17
2273.1773.1773.1773.1773.1773.17
4273.1773.1773.1773.1773.1773.17
6273.1773.1773.1773.1773.1773.17
8273.1773.1773.1773.1773.1773.17
10273.1773.1773.1773.1773.1773.17
Table 14

Accuracy (%) results obtained by Information Gain on FS4.

No. of treesDepth of trees
121416181100
246.3446.3446.3446.3446.3446.34
2246.3446.3446.3446.3446.3446.34
4246.3446.3446.3446.3446.3446.34
6246.3446.3446.3446.3446.3446.34
8246.3446.3446.3446.3446.3446.34
10246.3446.3446.3446.3446.3446.34
Table 15

Accuracy (%) results obtained by Gini Index on FS1.

No. of treesDepth of trees
121416181100
268.2968.2968.2968.2968.2968.29
2268.2968.2968.2968.2968.2968.29
4268.2968.2968.2968.2968.2968.29
6268.2968.2968.2968.2968.2968.29
8268.2968.2968.2968.2968.2968.29
10268.2968.2968.2968.2968.2968.29
Table 16

Accuracy (%) results obtained by Gini Index on FS2.

No. of treesDepth of trees
121416181100
278.0578.0578.0578.0578.0578.05
2278.0578.0578.0578.0578.0578.05
4278.0578.0578.0578.0578.0578.05
6278.0578.0578.0578.0578.0578.05
8278.0578.0578.0578.0578.0578.05
10278.0578.0578.0578.0578.0578.05
Table 17

Accuracy (%) results obtained by Gini Index on FS3.

No. of treesDepth of trees
121416181100
273.1773.1773.1773.1773.1773.17
2273.1773.1773.1773.1773.1773.17
4273.1773.1773.1773.1773.1773.17
6273.1773.1773.1773.1773.1773.17
8273.1773.1773.1773.1773.1773.17
10273.1773.1773.1773.1773.1773.17
Table 18

Accuracy (%) results obtained by Gini Index on FS4.

No. of treesDepth of trees
121416181100
246.3446.3446.3446.3446.3446.34
2246.3446.3446.3446.3446.3446.34
4246.3446.3446.3446.3446.3446.34
6246.3446.3446.3446.3446.3446.34
8246.3446.3446.3446.3446.3446.34
10246.3446.3446.3446.3446.3446.34
Table 19

Accuracy (%) results obtained by Gain Ratio on FS1.

No. of treesDepth of trees
11121314151
224.463.456.156.156.156.1
1224.475.668.368.368.368.3
2226.875.673.273.273.273.2
3226.875.673.270.770.770.7
4226.875.673.270.770.770.7
5226.875.673.270.770.770.7
Table 20

Accuracy (%) results obtained by Gain Ratio on FS2.

No. of treesDepth of trees
11121314151
224.468.368.368.368.368.3
1224.475.668.368.368.368.3
2226.87870.770.770.770.7
3226.87870.768.368.368.3
4226.873.273.273.273.273.2
5226.873.270.773.273.273.2
Table 21

Accuracy (%) results obtained by Gain Ratio on FS3.

No. of treesDepth of trees
11121314151
224.458.565.965.965.965.9
1224.468.365.968.368.368.3
2226.870.765.965.965.965.9
3226.868.368.368.368.368.3
4226.870.7370.7370.7370.7370.73
5226.870.7370.7370.7370.7370.73
Table 22

Accuracy (%) results obtained by Gain Ratio on FS4.

No. of treesDepth of trees
11121314151
224.441.546.348.848.848.8
1224.451.241.548.848.848.8
2226.851.243.948.848.848.8
3226.853.736.641.541.541.5
4226.853.736.641.541.541.5
5226.853.736.641.541.541.5
Table 23

Best results obtained by Gain ratio, Information gain and Gini Index (%) using different feature sets.

Feature setsGain ratio (%)Information gain (%)Gini index (%)
175.6068.2968.29
27870.7378.05
375.6173.1773.17
453.7046.3446.34

4.6.1. Information Gain

Table 11 shows the optimized results of RF by varying the number of trees and depth of trees with selecting the Information Gain as an attribute selection criterion. There is no change in results by tuning the number of trees and depth of trees. Table 12 shows no change in results with Information Gain also on FS2 by tuning the parameters of RF. Results depicted in Tables 13 and 14 show that using Information Gain as attribute selection criterion with tuning the parameters of RF does not have any impact on accuracy. Results show consistent value by varying the number of trees and their depth. But accuracy for different feature sets is different.

4.6.2. Gini Index

Tables 15–18 shows the results of RF by varying its parameters with Gini Index as attribute selection criterion. Results obtained by Gini Index are similar to results obtained by Information Gain by giving the single constant value with no difference in classification accuracy.

4.6.3. Gain Ratio

Tables 19–22 show the accuracy achieved by varying the number of trees and its depth with Gain Ratio. Results obtained with the Gain Ratio are better than Information Gain and Gini Index. Gain Ratio gives the highest accuracy of 75.6% with 12 trees and 11 depth on FS1. In Figure 10 it is shown that the maximum accuracy was achieved with 12 trees and 11 depth. After increasing the number of trees and depth from 12 to 11 respectively, accuracy is decreased instead of increasing. But this accuracy is greater by 13% than auto model results of RF on FS1.
Figure 10

Impact of different number of trees and depth on accuracy of FS1 choosing Gain Ratio as attribute selection criteria.

Maximum accuracy obtained on FS2 is 78% with Gain Ratio. These results are greater than the results of the auto model of RapidMiner tool which was 69% on FS2. Results are given in Table 20. Figure 11 shows the result of optimized RF on FS2. The highest accuracy is achieved with 22 trees and 11 depth. Accuracy is decreasing beyond 32 trees and 11 depth.
Figure 11

Impact of different number of trees and depth on accuracy of FS2 choosing Gain Ratio as attribute selection criteria.

Results on FS3 are shown in Table 21. The maximum accuracy achieved on FS3 is 70.73%. Results of optimized RF are 2% greater than the auto model of RF on FS3. The result of optimized RF on FS3 is shown in Figure 12.
Figure 12

Impact of different number of trees and depth on accuracy of FS3 choosing Gain Ratio as attribute selection criteria.

Table 22 shows the results on FS4. Accuracy on FS4 is 51.7% which is less than other feature sets but greater than auto model accuracy. The result of optimized RF on FS4 is shown in Figure 13. Comparison using Information Gain, Gini Index, and Gain Ratio as an attribute selection criterion is depicted in Table 23 and it is shown that Gain Ratio obtained higher accuracy than others.
Figure 13

Impact of different no. of trees and depth on accuracy of FS4 choosing Gain Ratio as attribute selection criteria.

4.7. Performance of Deep Learning Models

Besides using machine learning models and optimizing the performance of RF, this study conducted experiments using deep learning models. Table 24 shows the results of deep learning models, which indicate that the performance of deep learning models is poor as compared to results obtained with FS2 using RF classifier. Apparently, the small size of the data is not large enough for the deep learning models to learn the complex relationships. Similarly, for proper training, RestNet could not find a large feature set and its performance is lower than expected. Deep learning models are data-intensive and do not perform well when the data size is small [58].
Table 24

Performance evaluation of deep learning models.

ModelPrecision (%)Recall (%)F1 (%)Accuracy (%)
CNN0.720.720.700.72
LSTM0.770.760.760.77
RestNet0.740.720.730.74

4.8. Results Using k-Fold Cross-Validation

To confirm the performance of the used machine learning models and RF model especially, k-fold cross-validation is performed as well. Table 25 shows the results of 10-fold cross-validation for all the models used in this study. Results show that RF provides the highest accuracy of 0.78 followed by LSTM for the rose plant classification task. RF performance is compared with FS2 where its accuracy is the highest of all models.
Table 25

Results of 10-fold cross-validation.

FoldNBGLMDTRFGBTSVM
1st-fold0.5120.6180.4920.7910.6620.712
2nd-fold0.5220.6710.4870.7800.6380.722
3rd-fold0.4990.6900.4900.7930.6940.749
4th-fold0.5060.6370.4990.7870.6620.726
5th-fold0.5210.6290.4980.7730.6240.698
6th-fold0.5100.6260.4910.7900.6880.688
7th-fold0.4870.6170.4870.7890.6800.713
8th-fold0.5190.6940.4900.7910.6110.708
9th-fold0.4960.6340.4890.7690.6370.721
10th-fold0.5090.6450.4880.7900.6260.717
Average0.5080.6460.4910.7850.6520.715

4.9. Results of T-Test

For corroborating the performance of RF over other models, the statistical T-test is performed with the following hypotheses: Null Hypothesis (H0): The performance of RF is not statistically significant over other approaches. Alternative Hypothesis (Ha): The performance of RF is statistically significant over other approaches. Accepting the H0 indicates that the performance of RF and other models is similar with no substantial difference. On the other hand, rejecting the H0 and accepting the Ha confirms that the performance of other models and RF has substantial differences and RF shows superior performance to other models. For this study, the T-test rejects the H0 and accepts Ha as the value of t is 4.769 with a critical value equal to 0.491.

5. Conclusion

The current study aims to classify the rose plant leaves using NB, GLM, DL, DT, RF, GBT, and SVM; in essence, the performance of RF is extensively investigated. Several sets of experiments have been performed with histogram features and texture features. For analysing the efficacy of various combinations of these features, four feature sets are made concerning their correlation to the target class. Results indicate that SVM obtains the highest accuracy of 72% on FS1. The FS1 contains both texture and histogram features having a high correlation with the target class both features contribute to classification. Other classifiers show better performance when trained on FS2 and FS3. But the performance of all classifiers is poor on FS4 because it contains only texture features which indicate that using texture features alone is not useful for predicting true labels. A special emphasis is placed on ensemble models and various combinations of selected classifiers are used in this regard. The accuracy achieved by ensemble learning is higher than individual algorithms except for SVM. Owing to its better performance, RF is investigated in detail by varying the number of trees, its depth, and attribute selection criterion including Information Gain, Gini Index, and Gain ratio. This parameter tuning contributes to achieving better accuracy than using a model as the black box. The optimization of RF proves to show better results than the auto model. RF achieves the highest accuracy scores of 73.17% with Information Gain, 78.05% with Gini Index, and 78.0% with the Gain ratio for FS3, FS2, and FS2, respectively. The bias of Information Gain towards choosing the attributes with more information values leads to poor performance than that of Gain ratio and Gini Index. In addition, CNN, LSTM, and RestNet models are used for experiments, however, their performance is not better than the fine-tuned RF. Owing to the small size of the dataset, the performance of the deep learning models is not investigated extensively. We intend to enlarge the dataset size by collecting further images and utilizing the resampling techniques as well. We also plan to compare the performance analysis of other classification algorithms such as neural networks, SVM, and DT for rose classification problems. Also, it would be interesting to select features methodically.
  10 in total

1.  A Very Simple Safe-Bayesian Random Forest.

Authors:  Novi Quadrianto; Zoubin Ghahramani
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-06       Impact factor: 6.226

2.  Exploratory undersampling for class-imbalance learning.

Authors:  Xu-Ying Liu; Jianxin Wu; Zhi-Hua Zhou
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2008-12-16

3.  Deep Learning for Plant Identification in Natural Environment.

Authors:  Yu Sun; Yuan Liu; Guan Wang; Haiyan Zhang
Journal:  Comput Intell Neurosci       Date:  2017-05-22

Review 4.  Automated plant species identification-Trends and future directions.

Authors:  Jana Wäldchen; Michael Rzanny; Marco Seeland; Patrick Mäder
Journal:  PLoS Comput Biol       Date:  2018-04-05       Impact factor: 4.475

5.  Flowers, leaves or both? How to obtain suitable images for automated plant identification.

Authors:  Michael Rzanny; Patrick Mäder; Alice Deggelmann; Minqian Chen; Jana Wäldchen
Journal:  Plant Methods       Date:  2019-07-23       Impact factor: 4.993

6.  Detecting cassava mosaic disease using a deep residual convolutional neural network with distinct block processing.

Authors:  David Opeoluwa Oyewola; Emmanuel Gbenga Dada; Sanjay Misra; Robertas Damaševičius
Journal:  PeerJ Comput Sci       Date:  2021-03-02

7.  AI-Driven Framework for Recognition of Guava Plant Diseases through Machine Learning from DSLR Camera Sensor Based High Resolution Imagery.

Authors:  Ahmad Almadhor; Hafiz Tayyab Rauf; Muhammad Ikram Ullah Lali; Robertas Damaševičius; Bader Alouffi; Abdullah Alharbi
Journal:  Sensors (Basel)       Date:  2021-06-01       Impact factor: 3.576

8.  Plant Species Identification Using Computer Vision Techniques: A Systematic Literature Review.

Authors:  Jana Wäldchen; Patrick Mäder
Journal:  Arch Comput Methods Eng       Date:  2017-01-07       Impact factor: 7.302

9.  Acquiring and preprocessing leaf images for automated plant identification: understanding the tradeoff between effort and information gain.

Authors:  Michael Rzanny; Marco Seeland; Jana Wäldchen; Patrick Mäder
Journal:  Plant Methods       Date:  2017-11-08       Impact factor: 4.993

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.