| Literature DB >> 29236744 |
Wilson Castro1, Jimy Oblitas2,3, Roberto Santa-Cruz4, Himer Avila-George5.
Abstract
The objective of this research was to develop a methodology for optimizing multilayer-perceptron-type neural networks by evaluating the effects of three neural architecture parameters, namely, number of hidden layers (HL), neurons per hidden layer (NHL), and activation function type (AF), on the sum of squares error (SSE). The data for the study were obtained from quality parameters (physicochemical and microbiological) of milk samples. Architectures or combinations were organized in groups (G1, G2, and G3) generated upon interspersing one, two, and three layers. Within each group, the networks had three neurons in the input layer, six neurons in the output layer, three to twenty-seven NHL, and three AF (tan-sig, log-sig, and linear) types. The number of architectures was determined using three factorial-type experimental designs, which reached 63, 2 187, and 50 049 combinations for G1, G2 and G3, respectively. Using MATLAB 2015a, a logical sequence was designed and implemented for constructing, training, and evaluating multilayer-perceptron-type neural networks using parallel computing techniques. The results show that HL and NHL have a statistically relevant effect on SSE, and from two hidden layers, AF also has a significant effect; thus, both AF and NHL can be evaluated to determine the optimal combination per group. Moreover, in the three study groups, it is observed that there is an inverse relationship between the number of processors and the total optimization time.Entities:
Mesh:
Year: 2017 PMID: 29236744 PMCID: PMC5728525 DOI: 10.1371/journal.pone.0189369
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Cajamarca region map.
Dataset geographic coordinates.
| Point | Latitude | Longitude |
|---|---|---|
| R-12 | −6°56′28.451″ | −78°46′35.976″ |
| R-13 | −6°57′37.054″ | −78°42′45.183″ |
| R-14 | −6°48′59.133″ | −78°31′4.870″ |
| R-15 | −6°56′30.461″ | −78°39′14.912″ |
| R-23 | −7°0′12.497″ | −78°18′31.890″ |
| R-24 | −6°56′46.243″ | −78°12′51.233″ |
| R-25 | −7°8′9.444″ | −78°24′33.531″ |
| R-61 | −6°53′40.768″ | −78°34′12.428″ |
| R-63 | −7°10′51.045″ | −78°38′44.912″ |
| R-64 | −6°58′46.375″ | −78°33′31.697″ |
| R-72 | −7°5′19.486″ | −78°29′33.630″ |
| R-81 | −7°16′45.528″ | −78°24′33.087″ |
Analyses performed on the milk samples.
| Parameter | Method | Source | |
|---|---|---|---|
| Input | Density | Lactodensimeter (AOAC 925.22) | [ |
| Oxidation-Reduction Potential | Reaction time to methylene blue | [ | |
| Potential of Hydrogen (pH) | Potentiometer | [ | |
| Output | Proteins | Infrared spectroscopy (NTP 202.130:1998) | [ |
| Lactose | |||
| Total solids | |||
| Solids-fat | |||
| Solids-non-fat | |||
| Minerals |
Fig 2Condensed architecture for multilayer perceptron.
Ranges in structural parameters.
| Parameters | Range |
|---|---|
| Input neuron layer (IN) | 3 |
| Output neuron layer (ON) | 6 |
| Number of hidden layers (HL) | [1-3] |
| Neurons per hidden layer (NHL) | [3-27] |
| Activation functions | [1-3] |
⋆ (1) Hyperbolic tangent sigmoid (tan-sig)
(2) Log sigmoid (log-sig)
(3) Linear
Fig 3Groups of neural architectures proposed for the study.
Experimental designs used in this research.
| Group | Designs | Factors | |
|---|---|---|---|
| Name | Levels | ||
| 1 | 1HL2AF | NHL1 | [3 6 9 12 15 18 21 24 27] |
| AF1 | [1 2 3] | ||
| AF2 | [1 2 3] | ||
| 2 | 2HL3AF | NHL1 | [3 6 9 12 15 18 21 24 27] |
| NHL2 | [3 6 9 12 15 18 21 24 27] | ||
| AF1 | [1 2 3] | ||
| AF2 | [1 2 3] | ||
| AF3 | [1 2 3] | ||
| 3 | 3HL4AF | NHL1 | [3 6 9 12 15 18 21 24 27] |
| NHL2 | [3 6 9 12 15 18 21 24 27] | ||
| NHL3 | [3 6 9 12 15 18 21 24 27] | ||
| AF1 | [1 2 3] | ||
| AF2 | [1 2 3] | ||
| AF3 | [1 2 3] | ||
| AF4 | [1 2 3] | ||
Treatments per experimental design.
| Design | Distribution of elements | Number of Treatments |
|---|---|---|
| 1HL2AF | IN, NHL1 | 63 |
| 2HL3AF | IN, NHL1 | 2 187 |
| 3HL4AF | IN, NHL1 | 59 049 |
⋆Sub-indexes correspond to the levels assumed in each combination.
Fig 4Sequence for constructing, training, and evaluating networks.
Training and validation data per neural network.
| Variables | Units | Values | ||||
|---|---|---|---|---|---|---|
| min | max |
| ||||
| Input | Density (Dn) | g/ml | 1.026 | 1.03 | 1.028 | 0.001 |
| Oxidation-Reduction Potential (Rd) | hours | 6.5 | 6.79 | 6.63 | 0.049 | |
| Potential of Hydrogen | — | 6 | 8 | 6.5 | 0.637 | |
| Output | Proteins (Pr) | g/100 ml | 2.69 | 3.33 | 3.005 | 0.14 |
| Lactose (Lc) | g/100 ml | 4.31 | 5.24 | 4.85 | 0.187 | |
| Solids total (St) | g/100 ml | 10.89 | 13.14 | 12.22 | 0.433 | |
| Solids-fat (Sf) | g/100 ml | 3 | 4.1 | 3.62 | 0.183 | |
| Solids-non-fat (Snf) | g/100 ml | 7.73 | 9.27 | 8.54 | 0.31 | |
| Minerals (Mn) | g/100 ml | 0.41 | 0.71 | 0.7 | 0.023 | |
Fig 5First ten combinations for G1, G2, and G3 and interpretation.
Fig 6SSE for each group.
Variance analysis for SSE G1.
| Source | Sum of squares | Degrees of freedom | Mean square | Ratio-F | Value-P |
|---|---|---|---|---|---|
| Main Effects | |||||
| NHL1 | 0.1243 | 6 | 0.0207 | 4.86 | 0.0005 |
| AF1 | 0.0145 | 2 | 0.0073 | 1.7 | 0.1927 |
| AF2 | 0.0247 | 2 | 0.0124 | 2.9 | 0.0640 |
| Residual | 0.2217 | 52 | 0.0043 | ||
| Total (corrected) | 0.3853 | 62 |
⋆ Reliability level 99%.
Variance analysis for SSE G3.
| Source | Sum of squares | Degrees of freedom | Mean square | Ratio-F | Value-P |
|---|---|---|---|---|---|
| Main Effects | |||||
| NHL1 | 7.6283 | 8 | 0.9535 | 147.58 | 0 |
| NHL2 | 2.7694 | 8 | 0.3462 | 53.58 | 0 |
| NHL3 | 2.464 | 8 | 0.308 | 47.67 | 0 |
| AF1 | 38.776 | 2 | 19.3881 | 3000.69 | 0 |
| AF2 | 0.4399 | 2 | 0.2199 | 34.04 | 0 |
| AF3 | 4.6349 | 2 | 2.3174 | 358.67 | 0 |
| AF4 | 3.5627 | 2 | 1.7814 | 275.7 | 0 |
| Residual | 381.3121 | 59016 | 0.0065 | ||
| Total (corrected) | 441.587 | 59048 |
⋆ Reliability level 99%.
Fig 7Interaction of structural parameters per group.
Optimal values of NHL and AF per group.
| Factor | Optimal values | ||
|---|---|---|---|
| G1 | G2 | G3 | |
| NHL1 | 22 | 25 | 18 |
| NHL2 | — | 27 | 27 |
| NHL3 | — | — | 26 |
| AF1 | 2 | 3 | 3 |
| AF2 | 1 | 1 | 3 |
| AF3 | — | 2 | 1 |
| AF4 | — | — | 1 |
| SSE | 1.0217 | 0.9876 | 0.9847 |
Fig 8Calculation times per group and number of processors.
Fig 9Acceleration of optimization per groups.
Fig 10Efficiency in the optimization of the groups.
Variance analysis for SSE G2.
| Source | Sum of squares | Degrees of freedom | Mean square | Ratio-F | Value-P |
|---|---|---|---|---|---|
| Main Effects | |||||
| NHL1 | 0.6434 | 8 | 0.0804 | 13.41 | 0 |
| NHL2 | 0.2346 | 8 | 0.0293 | 4.89 | 0 |
| AF1 | 0.9168 | 2 | 0.4584 | 76.45 | 0 |
| AF2 | 0.1084 | 2 | 0.0542 | 9.04 | 0.0001 |
| AF3 | 0.2903 | 2 | 0.1451 | 24.2 | 0 |
| Residual | 12.9758 | 2164 | 0.006 | ||
| Total (corrected) | 15.1693 | 2186 |
⋆ Reliability level 99%.