| Literature DB >> 24453933 |
C Fernandez-Lozano1, C Canto1, M Gestal1, J M Andrade-Garda2, J R Rabuñal1, J Dorado1, A Pazos1.
Abstract
Given the background of the use of Neural Networks in problems of apple juice classification, this paper aim at implementing a newly developed method in the field of machine learning: the Support Vector Machines (SVM). Therefore, a hybrid model that combines genetic algorithms and support vector machines is suggested in such a way that, when using SVM as a fitness function of the Genetic Algorithm (GA), the most representative variables for a specific classification problem can be selected.Entities:
Mesh:
Year: 2013 PMID: 24453933 PMCID: PMC3874306 DOI: 10.1155/2013/982438
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1General outline of the operation of an evolutionary algorithm.
Figure 2Linearly separable classification.
Figure 3Linearly separable classification; hyperplane and separation margin.
Figure 4Slack variables.
Algorithm 1
Figure 5Evaluation of the genetic individuals.
Samples with high concentration.
| Range: 20%–100% | |||
|---|---|---|---|
| % | Training | Test | Commercial |
| 20% | 20 | 6 | |
| 25% | 19 | 18 | 2 |
| 50% | 16 | 13 | |
| 70% | 14 | 1 | |
| 100% | 17 | 6 | 19 |
|
| |||
| Total |
|
|
|
Samples with low concentration.
| Range: 2%–20% | |||
|---|---|---|---|
| % | Training | Test | Commercial |
| 2% | 19 | 1 | |
| 4% | 17 | 1 | |
| 6% | 16 | 13 | |
| 8% | 22 | 6 | |
| 10% | 21 | 6 | 1 |
| 16% | 20 | 6 | 1 |
| 20% | 19 | 6 | |
|
| |||
| Total |
|
|
|
Figure 6IR spectrum, specific to a sample.
Setting of SVM parameters: samples with high concentration.
| SVM parameters | MSE train | |||
|---|---|---|---|---|
|
| Epsilon | Radial | ||
| 1° | 10000 | 0,000001 | 5 | 3,83 |
| 2° | 10000 | 0,0001 | 2 | 4,49 |
| 3° | 10000 | 0,0001 | 2 | 4,49 |
Setting of SVM parameters: samples with low concentration.
| SVM parameters | MSE train | |||
|---|---|---|---|---|
|
| Epsilon | Radial | ||
| 1° | 1000 | 0,000001 | 2 | 5,26 |
| 2° | 1000 | 0,000001 | 10 | 5,95 |
| 3° | 1000 | 0,0001 | 2 | 5,95 |
Genetic algorithm configuration.
| (i) GA: simple | (i) Selection: wheel-roulette |
| (ii) Number of individuals: 100 | (ii) Number of generations: 100 |
| (iii) Crossover rate: 90% | (iii) Crossover: uniform |
| (iv) Mutation rate: 10% | (iv) Mutation: uniform |
High concentration: classification errors using 176 variables.
| Training | Validation | Commercial | |
|---|---|---|---|
| PLS | 11 | 5 | 1 |
| SIMCA | 15 | 12 | 1 |
| Potential functions | 4 | 6 | 0 |
| ANN | 0 | 1 | 0 |
| SVM | 0 | 39 | 21 |
Low concentration: classification errors using 176 variables.
| Training | Validation | Commercial | |
|---|---|---|---|
| PLS | 29 | 11 | 0 |
| SIMCA | 19 | 14 | 0 |
| Potential functions | 4 | 9 | 0 |
| ANN | 0 | 4 | 0 |
| SVM | 0 | 33 | 1 |
Classification errors using variables selected by Procrustes rotation. High concentration.
| Training | Validation | Commercial | |
|---|---|---|---|
| PLS | 6 | 6 | 1 |
| SIMCA | 44 | 27 | 8 |
| Potential functions | 5 | 6 | 0 |
| ANN | 6 | 8 | 0 |
| SVM | 5 | 3 | 3 |
Classification errors using variables selected by Procrustes rotation. Low concentration.
| Training | Validation | Commercial | |
|---|---|---|---|
| PLS | 29 | 9 | 0 |
| SIMCA | 36 | 17 | 0 |
| Potential functions | 29 | 13 | 0 |
| ANN | 28 | 17 | 2 |
| SVM | 13 | 4 | 1 |
Classification Errors using GA + ANN: high concentration.
| High concentrations | |||
|---|---|---|---|
| Selected | Train | Test | Commercial |
| [52, 141] | 4 | 8 | 1 |
| [102, 129] | 10 | 14 | 2 |
| [23, 67] | 5 | 16 | 1 |
| [18, 120] | 3 | 12 | 2 |
Classification errors using GA + ANN: low concentration.
| Low concentrations | |||
|---|---|---|---|
| Selected | Train | Test | Commercial |
| [52, 141] | 19 | 17 | 2 |
| [102, 129] | 10 | 14 | 2 |
| [23, 67] | 5 | 16 | 1 |
| [18, 120] | 3 | 12 | 2 |
Classification errors using GA + SVM: high concentration.
| Selected | Train | Test | Commercial |
|---|---|---|---|
| MSE train | |||
| [4, 69] | 5 | 5 | 2 |
| [81, 84] | 9 | 5 | 2 |
| [68, 28] | 4 | 6 | 2 |
| [91, 101] | 4 | 6 | 1 |
|
| |||
| MSE train + MSE test | |||
| [9, 105] | 5 | 3 | 3 |
| [9, 113] | 5 | 4 | 9 |
| [34, 101] | 4 | 5 | 11 |
| [86, 104] | 5 | 5 | 4 |
Classification errors using GA + SVM: low concentration.
| Selected | Train | Test | Commercial |
|---|---|---|---|
| MSE train | |||
| [1, 107] | 15 | 5 | 1 |
| [5, 91] | 16 | 5 | 1 |
| [2, 107] | 17 | 5 | 1 |
| [11, 97] | 14 | 6 | 1 |
|
| |||
| MSE train + MSE test | |||
| [5, 73] | 13 | 4 | 1 |
| [27, 90] | 14 | 4 | 1 |
| [2, 94] | 15 | 4 | 1 |
| [27, 88] | 14 | 5 | 2 |
Figure 7Classification errors according to the used fitness function.