Literature DB >> 34663842

A modeling method for the development of a bioprocess to optimally produce umqombothi (a South African traditional beer).

Edwin Hlangwani1, Wesley Doorsamy2, Janet Adeyinka Adebiyi1, Lanrewaju Ibrahim Fajimi3, Oluwafemi Ayodeji Adebo4.   

Abstract

Bioprocess development for umqombothi (a South African traditional beer) as with other traditional beer products can be complex. As a result, beverage bioprocess development is shifting towards new systematic protocols of experimentation. Traditional optimization methods such as response surface methodology (RSM) require further comparison with a relevant machine learning system. Artificial neural network (ANN) is an effective non-linear multivariate tool in bioprocessing, with enormous generalization, prediction, and validation capabilities. ANN bioprocess development and optimization of umqombothi were done using RSM and ANN. The optimum condition values were 1.1 h, 29.3 °C, and 25.9 h for cooking time, fermentation temperature, and fermentation time, respectively. RSM was an effective tool for the optimization of umqombothi's bioprocessing parameters shown by the coefficient of determination (R2) closer to 1. RSM significant parameters: alcohol content, total soluble solids (TSS), and pH had R2 values of 0.94, 0.93, and 0.99 respectively while the constructed ANN significant parameters: alcohol content, TSS, and viscosity had R2 values of 0.96, 0.96, and 0.92 respectively. The correlation between experimental and predicted values suggested that both RSM and ANN were suitable bioprocess development and optimization tools.
© 2021. The Author(s).

Entities:  

Year:  2021        PMID: 34663842      PMCID: PMC8523540          DOI: 10.1038/s41598-021-00097-w

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

The heterogeneous nature of food complicates food bioprocessing operations through varying responses to process conditions[1]. Thus, the development and application of dynamic optimization approaches is an important step towards ensuring robust process control, quality, and consumer safety[2,3]. The technical application of these approaches, especially in a biologically complex product such as traditional beer has been minimal[4]. Furthermore, variable microbial growth kinetics, process constraints, biochemical reactions, dynamic food matrices, and difficult bioprocessing requirements amplify complexities in bioprocess development and optimization[3,5]. As a result, the combination of linear and non-linear techniques is an effective approach to describe, analyze, and predict bioprocess responses that impact the outcomes of the final product[3,6]. The use of a single technique may not be adequate in ascertaining the relationship between process input variables and the quality of the product[5]. Nonetheless, standalone mathematical and statistical models have been previously successful in describing the linear, interactive, and quadratic effects of selected parameters in beer bioprocessing[6,7]. Response surface methodology (RSM) and factorial experiment with their associated designs are traditional statistical models which have been applied extensively to screen and optimize factors in the biotechnology and food engineering industries[3,6]. RSM consists of a group of empirical techniques which evaluate the relationship between a group of control experiment parameters to achieve an optimal process[8,9]. In particular, RSM has been used as a statistical method to generate efficient models to optimize very large and complex bioprocesses in food systems[1,10]. RSM determines the significance of a model and defines the relationship between process variables through analysis of variance (ANOVA) and the lack of fit[1,11]. Moreover, the optimum conditions are determined by the desirability function[10,11]. However, traditional techniques show significant limitations in biological processes[3,6]. For example, RSM disregards parameters deemed insignificant without accounting for the possible interactive effects on the output of bioprocess. Artificial intelligence (AI) and machine learning (ML) tools such as fuzzy logic, ANN, particle swarm optimization (PSO), and genetic algorithm (GA) are emerging technologies appropriate for the research and development of efficient bioprocesses[6,12]. Recently, the application of 2-step or 3-step optimization approaches involving RSM, ANN, and GA has become standard practice for manufacturing and other biological processes[7,12]. However, these tools and approaches have been rarely applied in the modeling and optimization of brewing and fermentation processes[3]. ANN has been successful in accurately approximating linear and non-linear functions from historic data devoid of cellular kinetics and metabolic fluxes, especially in multivariate bioprocesses[3,6]. ANNs are mathematical emulations of the biological learning process occurring within the brain. It can arithmetically model the network structure of interconnected nerve cells, and thus “learns”, link associations, and adapts to make accurate value predictions from a specific sample set[13]. ANN possesses extraordinary processing abilities such as self-organization and data classification, pattern recognition, processing fuzzy and inaccurate information, good generalization capabilities, quicker processing time, noise and fault tolerance as well as high parallelism[12,13]. Given its numerous benefits, the use of ANN as a non-linear multivariate tool in bioprocess development can improve both the bioprocess and the final product[14,15]. For umqombothi bioprocessing, ANN presents a unique advantage in the improving developed RSM models since its standard framework has an inherent ability to use background information to solve problems[4,16]. Bioprocessing approaches that apply both RSM with adaptive learning techniques such as ANN have been shown to have better accuracy, prediction, and dependence relation when compared to traditional, isolated RSM[17,18]. As such, bioprocess development and optimization without carefully deliberated process designs will result in irreproducible and unreliable process designs[4,10]. In this study, a modeling method for the development of a bioprocess to optimally produce umqombothi was investigated.

Methodology

Traditional beer () brewing process

Five hundred (500) g of pre-packaged King Korn malted sorghum (Mtombo – Mmela) (Tiger Brands, Bryanston, South Africa) was mixed with 1000 g of White Star maize meal (Pioneer Foods, Bryanston, South Africa) in a sterile 10 L bucket filled with 7 L tap water. The mixture was gently stirred, covered, and incubated (Labcon, Chamdor, South Africa) at 25 °C for 24 h to sour. Thereafter, the soured paste was stirred gently and cooked for 30 min at 45 °C to make a traditional beer porridge (isdudu). The porridge was allowed to cool to 25 °C after which 500 g of King Korn malted sorghum (Tiger Brands, Bryanston, South Africa) was added and gently stirred. The mixture was then incubated at 30 °C (Labcon, Chamdor, South Africa) for 24 h to ferment. The finished beer was then tested for physicochemical properties.

Experimental design using response surface methodology (RSM)

Preliminary experiments (data not presented herein) were conducted to determine appropriate ranges for processing factors: cooking time, fermentation temperature, and fermentation time and their effects on alcohol content, total soluble solids (TSS), total titratable acidity (TTA), pH, and viscosity in umqombothi. The obtained data was then used for the design of experiments (DOE) (Fig. 1). Thereafter, appropriate ranges were determined for factors of interest (Table 1). Central Composite Design (CCD) in Design-Expert software version 11.0.0 (Stat-Ease Inc., Minneapolis, USA) was used to generate 20 experimental runs. The input factors were cooking time (hr), fermentation temperature (°C), and fermentation time (hr) (Table 1). Following experimental combinations (Table 2) subsequent experiments were conducted.
Figure 1

A flow chart of the complete experimental design and optimization techniques.

Table 1

Process parameters selected for optimization: cooking time, fermentation temperature, and fermentation time.

ParametersCodeHigh level (+ 1)Medium level (0)Low level (–1)
Cooking time (hr)X1321
Fermentation temperature (°C)X2353025
Fermentation time (hr)X3966024

hr hour.

Table 2

Experimental design of umqombothi.

Experimental runCooking time (hr)Fermentation temperature (°C)Fermentation time (hr)
1238.4160
223060
333524
423060
523060
632524
723060
813596
932596
1012524
1123060
1212596
1333596
143.683060
15221.5960
1613524
170.323060
18230120.54
1923060
202300

hr hour.

A flow chart of the complete experimental design and optimization techniques. Process parameters selected for optimization: cooking time, fermentation temperature, and fermentation time. hr hour. Experimental design of umqombothi. hr hour. Samples were withdrawn after each experimental run (done in triplicates) and alcohol content (°P), TSS (g/100 g), TTA (% lactic acid), pH, viscosity (cm/min) were determined. The Design-Expert software was also used to analyze and compute a second-order polynomial model to estimate and predict response values over a range of input parameter values by determining which input factors influenced responses, and the direction of that drive for the designed experiments as depicted in Eq. (1) below:where indicated the response variable (optimal production parameter), the intercept of the response variable, while , , and were coefficients corresponding to the factor , ( = 1, 2, …, n). The input variables that affected the response were ,, . The random error was represented by .

Neural network construction and fitting

Experimental data was organized and used for the development of ANN prediction models. A matrix laboratory MATLAB R2020a (MathWorks, Massachusetts, USA) software was used for the design of function fitting neural network. A feed-forward neural network with two layers was used. The first layer was the input layer and the second layer was the output layer, both of which were triggered using the sigmoid activation function. Cooking time (hr), fermentation temperature (°C) and fermentation time (hr) were used as network inputs and alcohol content (°P), TSS (g/100 g), TTA (% lactic acid), pH, and viscosity (cm/min), were each used as the outputs to develop several networks and to determine the optimal network topology. Experimental data were randomly divided for training, validation, and testing. For training, 14 (70%) instances were used, 3 (15%) for validation and 3 (15%) for testing. The ANN model was then trained, validated, and tested by the Levenberg–Marquardt (LM) training algorithm. To further study the responses of the model, Bayesian Regularization (BR) and Scaled Conjugate Gradient (SCG) training algorithms were also evaluated. The network was trained until the coefficient of correlation (R) was closer to 1.

Determination of physicochemical properties

Alcohol content

The alcohol content of the finished beer was determined using a digital refractometer for brewing (Hanna Instruments (Pty) Ltd., Johannesburg, South Africa). A clean pipette was used to place 0.5–1 ml of the finished beer on the sample well. The Plato readings were recorded afterward.

pH

The pH of the finished beverage was determined using a portable pH meter (Hanna Instruments (Pty) Ltd., Johannesburg, South Africa) after calibration with standard buffers of pH 4.00 and 7.00.

Total soluble solids

The total soluble solids of the finished beer were determined using a digital refractometer (Hanna Instruments (Pty) Ltd., Johannesburg, South Africa). A clean pipette was used to place 0.5–1 ml of the finished beer on the sample well. The refractive indices of the samples were then recorded accordingly.

Viscosity

The consistometer (Endecotts, London, United Kingdom) was used to determine the consistency of the finished beer (cm/min) by pouring 100 ml of the sample into the reservoir behind the gate of the consistometer. The lock release lever was released to instantaneously open the gate, allowing the liquid to flow over the instrument’s graduated scale for 1 min.

Total titratable acidity

The American Association of Cereal Chemists (AACC) 02-31[19] approved method was used to determine the total titratable acidity whereby 10 g of the sample was dissolved in 100 ml distilled water. The solution was well mixed and 0.5 ml of 1% phenolphthalein indicator was added. Finally, standardized 0.1 N sodium hydroxide was used to titrate the prepared solution until a faint pink color was observed. Titratable acidity (in terms of lactic acid %) = volume (ml) required / 20.

Statistical analysis

All experiments and analyses were conducted in triplicates. ANOVA was employed to determine the significance of the generated models. Design-Expert software version 11.0.0 (Stat-Ease Inc., Minneapolis, USA) was used to determine the Response (Y) of the second-order polynomial equation, the coefficient of determination (R2), the ‘predicted R-squared’ and ‘adjusted R-squared’, the coefficient of variance (CV), and the ‘probability F’ value.

Statement on experimental research and field studies on plants

We confirm that the use of plant-based cereals in our study complied with the relevant institutional, national, and international guidelines and legislation, in particular the IUCN Policy Statement on Research Involving Species at Risk of Extinction.

Results and discussion

The effect of cooking time, fermentation temperature, and fermentation time on the alcohol content, TSS, TTA, pH, and viscosity were investigated. Optimization of cooking time, fermentation temperature, and fermentation time is essential for maintaining consistent physicochemical properties, curbing undesired changes that may occur during bioprocessing, and understanding the interactions among these process variables at different conditions[1]. In beer production, these are principal factors that influence the final product and its acceptance by consumers[20,21].

The effect of input factors on the physicochemical properties of the beer

Samples fermented for a longer period (≥ 60 h) at a relatively higher temperature (≥ 30 °C) contained a lower alcohol content (Table 3, see experimental run numbers 1, 4, 7, 9, 15, and 20). Generally, a higher fermentation temperature affects the rate of sugar metabolism (i.e., leads to a rapid increase in alcohol content and other by-products such as volatile compounds)[21]. On a contrary, in this study, a higher temperature accompanied by a longer fermentation time led to a lower alcohol content (Table 3). Given these conditions, a low alcohol content may be attributed to evaporative ethanol loss. It’s not uncommon for product inhibition to occur during simultaneous saccharification and fermentation, whereby ethanol, a fermentation product, inhibits zymase over time while the products of saccharification inhibit hydrolytic enzymes[22]. In addition, the synthesis of acetate and acids such as formic acid, acetic acid, and levulinic acid at concentrations above 100 mM may inhibit the bioconversion of biomass[22,23] and thus influence alcohol content.
Table 3

Responses from the investigated input parameters.

Exp runInputsResponses
Cooking time (hr)Ferm temp (°C)Ferm time (hr)Alcohol (°P)TSS (g/100 g)TTA (% lactic acid)pHViscosity (cm/min)
1230608.07de ± 0.707.37cde ± 0.311.18hi ± 0.032.90ab ± 0.0515.33cde ± 0.58
2230608.70efg ± 0.307.73def ± 0.551.20ij ± 0.022.88ab ± 0.0314.50 cd ± 0.87
3325967.77d ± 0.757.37cde ± 0.601.07 fg ± 0.062.91ab ± 0.0110.83a ± 1.04
43.6830606.77c ± 0.066.63c ± 0.310.81e ± 0.082.90ab ± 0.0517.17 g ± 1.15
5125965.10a ± 0.354.90ab ± 0.440.81e ± 0.032.95ab ± 0.0819.17 h ± 0.29
6125249.50hi ± 0.209.27 h ± 0.311.54 m ± 0.043.62de ± 0.0222.67j ± 0.58
70.3230607.00c ± 0.266.70c ± 0.260.72bc ± 0.022.90ab ± 0.0525.00 k ± 1.00
8230010.47j ± 0.2110.3i ± 0.300.50a ± 0.034.60f. ± 0.2012.83b ± 0.29
9230120.544.70a ± 0.204.80a ± 0.260.72bc ± 0.033.26c ± 0.0216.50 fg ± 0.87
10221.59608.73efg ± 0.818.37 fg ± 0.650.78cde ± 0.052.99b ± 0.0114.17c ± 0.29
11230608.87fgh ± 0.327.90ef ± 0.461.21ij ± 0.052.92ab ± 0.0316.17efg ± 0.29
12230608.13de ± 0.157.87ef ± 0.121.13gh ± 0.042.91ab ± 0.0315.67def ± 0.58
13230608.43ef ± 0.297.87ef ± 0.381.22ij ± 0.022.90ab ± 0.0715.00 cd ± 0.00
14238.41609.50hi ± 0.309.60hi ± 0.850.79de ± 0.032.86ab ± 0.0517.33 g ± 0.58
15135967.07c ± 0.217.03 cd ± 0.150.68b ± 0.022.81a ± 0.0314.17c ± 0.29
16335249.27gh ± 0.159.03gh ± 0.381.35 k ± 0.043.36c ± 0.0314.33c ± 0.58
17135249.17gh ± 0.068.93gh ± 0.211.44 l ± 0.023.73e ± 0.2520.83i ± 0.76
18230608.37def ± 0.358.10ef ± 0.171.25j ± 0.012.88ab ± 0.0314.67 cd ± 0.29
193252410.07ij ± 0.129.50 h ± 0.781.05f. ± 0.053.51d ± 0.0216.83 g ± 0.76
20335965.73b ± 0.155.60b ± 0.100.74bcd ± 0.022.84ab ± 0.0412.33b ± 0.29

cm centimetre, Exp experimental, Ferm fermentation, g gram, hr hour, min minute, temp temperature.

*Each value is a mean of triplicates ± standard deviation of triplicates. Means with no common letters within a row significantly differ (p < 0.05).

Responses from the investigated input parameters. cm centimetre, Exp experimental, Ferm fermentation, g gram, hr hour, min minute, temp temperature. *Each value is a mean of triplicates ± standard deviation of triplicates. Means with no common letters within a row significantly differ (p < 0.05).

TSS

Cooking the soured porridge for an adequate amount of time is essential for starch gelatinization and release of locked-up nutrients in yeasts cells[24]. The cooking time was found to influence the alcohol content, TSS, pH, and viscosity (Table 3). The proliferation of fermentative microbes is driven by the hydrolysis of cooked starch to fermentable sugars by endogenous amylolytic enzymes[25]. As the endosperm protein enclosing the starch granules is softened (during gelatinization), moving the grain to the retting water, thereby increasing the amount of TSS[26]. This might explain the increasing trend in the amount of TSS with an increased cooking and fermentation time. As observed from Table 3, cooking for more than 1 h significantly increased the amount of TSS. A reverse trend was observed when the fermentation time was increased. This could be attributed to the growth patterns of microorganisms that correspond to the consumption of soluble solids over time[26]. The fermentation time largely contributed to the final product’s quality. The longer the fermentation was allowed to proceed, the lower the alcohol content, pH, and viscosity (Table 3). Fermentative microorganisms need sufficient time to adjust to environmental changes for optimal utilization of the substrate for building cellular components (RNA, enzymes, and metabolites)[27]. As cells complete the cell cycle, they enter the exponential growth phase, where they are the healthiest and most uniform, rapidly driving alcoholic fermentation forward[27]. A fermentation time of 24 h was observed to have a relatively higher alcohol content, TSS, and TTA levels compared to 60 h and 96 h. It is possible that within this timeframe fermentative microorganisms entered the exponential phase growth phase which led to a higher microbial activity rate.

pH and TTA

The TTA and pH ranged between 0.50–1.54% lactic acid and 2.81–4.60, respectively (Table 3). Generally, umqombothi and other African traditional beers have a pH range of 3 to 4.2, and a lactic acid level of 0.26% depending on how the beer is brewed[4,24]. Changes in TTA may be a better measure of the success rate of the fermentation process than changes in pH[26]. A biochemical relationship between alcohol content, TTA, and pH, whereby a lower pH was directly proportional to a high TTA and alcohol content, was observed in this study (Table 3). According to[25], as the microorganisms carry out alcoholic fermentation, a decrease in the TSS and pH are usually observed. Beers with decreased pH values, such as umqombothi (Table 3) have a longer shelf-life, better safety and quality, superior facilitation of microbial growth, and a higher concentration of antimicrobial agents[28]. The low pH and elevated acidity in these beers aid in the elimination of certain pathogenic microorganisms that could pose safety threats[29,30]. Cooking time had a direct influence on the final beer’s viscosity. This is because cooking increasing the availability of starch, which imparts viscidness to food and describes the clarity of the finished beer[31]. In addition, residual starch from incomplete hydrolysis into sugar contributes to a beer’s viscosity[25]. As TTA increases and the pH is lowered, the joint action of malt α-and-β amylases is reduced, thereby reducing the beer’s viscosity, and giving body to the final beer[25]. An increase in the α- amylase, Hitempase 2XL decreased the viscosity in beer produced from malted buckwheat[32]. In western beers, filtration of the beer may be difficult due to high viscosity, thus leading to starch hazes in the final product[32], while in traditional beers such as umqombothi, filtering the beer may result in the loss of important fiber-imparting solids, giving the beer a higher viscosity[4,33,34].

Multi-response optimization of process parameters

In search for the solution, ANOVA, and Fisher’s F-values were used to examine the best fit of the generated RSM models. Model adequacy was determined by the coefficient of determination values (R2) and lack of fit tests[1,20]. For the response in view, the R2 described the percentage contribution of the process variables (i.e., the amount of variation around the mean explained by the model). For high-confidence prediction purposes, a usable model demands percentage contribution of 88% (R2 > 0.88)[35]. The probability of significance was represented by p-values, with a high p-value indicating an inadequate model due to a significant lack of fit[36]. The models for alcohol content, TSS, and pH all had p-values of 0.00, indicating that the lack of fit was insignificant at a 100% confidence level. Polynomial equations together with 3D response surface plots were used to describe the mathematical solutions of the models. Polynomial equations for alcohol content, TSS, TTA, pH, and viscosity are shown in Eqs. (2), (3), (4), (5), and (6), respectively. For better visualization, 3D response surface plots for alcohol content, TSS, TTA, pH, and viscosity are shown in Fig. 2a,b. Regression equations from the fitted models were used to generate 3D plots.
Figure 2

(a) 3D response surface plots demonstrating the effect of cooking time, fermentation temperature, and time on umqombothi samples: (A) Alcohol content, (B) TSS, (C) TTA. (b) A 3D response surface plot demonstrating the effect of cooking time, fermentation temperature, and time on umqombothi samples: (A) pH, (B) Viscosity.

(a) 3D response surface plots demonstrating the effect of cooking time, fermentation temperature, and time on umqombothi samples: (A) Alcohol content, (B) TSS, (C) TTA. (b) A 3D response surface plot demonstrating the effect of cooking time, fermentation temperature, and time on umqombothi samples: (A) pH, (B) Viscosity. The models for optimizing the alcohol content (°P), TSS (g/100 g) and pH, in the beer, were found to be significant as implied by high model F-values (F ≥ 10) and low p-values (p ≤ 0.05) (Table 4). For the alcohol content and TSS models, X3, X1X2, X12 were significant model terms (p ≤ 0.05) (Table 4). Significant model terms for pH were X3, X1X3, X32, with p-values of 0.00, 0.047, and 0.00 respectively (Table 4). The predicted determination (pred-R2) values for alcohol content and TSS were not as close to the adjusted determination (adj-R2) indicating a slight limitation with the model (Table 5). A consideration of outliers, model reduction, and response transformation may improve the empirical model[37]. In contrast, the predicted determination (pred-R2) of 0.89 in the pH optimization model was reasonably close to the adjusted determination (adj-R2) of 0.97, thus confirming the model’s accuracy in correctly predicting responses (Table 5). Adequate precision values above 4 indicated an adequate signal-to-noise ratio. This means the optimization models for alcohol content, TSS, and pH were suitable to navigate the design space and all of the model’s parameters showed that the developed models were able to predict the responses correctly. The optimization models for alcohol content, TSS had reproducibility above 90% (R2 ≥ 0.90) and low coefficient of variation (C.V. %) values (Table 5), indicating a good precision for the capability of the process under evaluation.
Table 4

Analysis of variance (ANOVA) for the alcohol content, TSS, TTA, pH, and viscosity quadratic models.

SourceSum of squaresdfMeanF-valuep-value
Alcohol content
Model44.7499.0004.97216.4350.000*
X1–cooking time0.1911.0000.1910.6300.446
X2–fermentation temperature0.0011.0000.0010.0020.964
X3–fermentation time35.5011.00035.501117.3480.000*
X1X22.5091.0002.5098.2930.016*
X1X30.0541.0000.0540.1800.680
X2X30.1401.0000.1400.4640.511
X123.8701.0003.87012.7910.005*
X221.0541.0001.0543.4830.092
X321.0281.0001.0283.3990.095
Residual3.02510.0000.303
Lack of fit2.5365.0000.507
Pure error0.4905.0000.098
Corrected total sum of squares47.77419.000
TSS
Model40.6209.0004.51315.6850.000*
X1–cooking time0.1151.0000.1150.3990.542
X2–fermentation temperature0.1921.0000.1920.6670.433
X3–fermentation time32.5031.00032.503112.9510.000*
X1X22.0301.0002.0307.0550.024*
X1X30.0631.0000.0630.2190.650
X2X30.1711.0000.1710.5950.458
X122.4201.0002.4208.4090.016*
X222.4301.0002.4308.4440.016*
X320.1261.0000.1260.4370.524
Residual2.87810.0000.288
Lack of fit2.5785.0000.516
Pure error0.2995.0000.060
Corrected total sum of squares43.49819.000
TTA
Model0.6789.0000.0750.8190.613
X1–cooking time0.0011.0000.0010.0090.925
X2–fermentation temperature0.0041.0000.0040.0470.833
X3–fermentation time0.2141.0000.2142.3200.159
X1X20.0051.0000.0050.0540.820
X1X30.1011.0000.1011.1000.319
X2X30.0541.0000.0540.5910.460
X120.0701.0000.0700.7610.404
X220.0571.0000.0570.6140.451
X320.2171.0000.2172.3560.156
Residual0.92110.0000.092
Lack of fit0.9125.0000.182
Pure error0.0085.0000.002
Corrected total sum of squares1.59919.000
pH
Model3.7109.0000.41276.1740.000*
X1–cooking time0.0181.0000.0183.2490.102
X2–fermentation temperature0.0161.0000.0162.9720.115
X3–fermentation time1.8251.0001.825337.2270.000*
X1X20.0051.0000.0050.8340.383
X1X30.0281.0000.0285.1030.047*
X2X30.0041.0000.0040.6680.433
X120.0021.0000.0020.4040.539
X220.0001.0000.0000.0320.861
X321.7981.0001.798332.1800.000*
Residual0.05410.0000.005
Lack of fit0.0535.0000.011
Pure error0.0015.0000.000
Corrected total sum of squares3.76419.000
Viscosity
Model165.3849.00018.3762.9940.051
X1–cooking time93.2621.00093.26215.1930.003*
X2–fermentation temperature0.4671.0000.4670.0760.788
X3–fermentation time10.5411.00010.5411.7170.219
X1X24.2631.0004.2630.6940.424
X1X30.5831.0000.5830.0950.764
X2X30.0881.0000.0880.0140.907
X1251.2751.00051.2758.3530.016*
X220.0001.0000.0000.0001.000
X322.0081.0002.0080.3270.580
Residual61.38610.0006.139
Lack of fit59.3995.00011.880
Pure error1.9865.0000.397
Corrected total sum of squares226.77019.000

*Significant at p ≤ 0.05.

Table 5

Fit statistics of the quadratic model for alcohol content, TSS, TTA, pH, and viscosity optimization.

ParametersC.V. %R2Adjusted R2Predicted R2Adequate precision
Alcohol6.8150.9370.8800.52913.942
TSS6.9280.9280.8740.47413.657
TTA30.0560.424–0.094–3.3352.986
pH2.3490.9860.9730.88832.559
Viscosity15.2230.7290.486–1.0457.005

C.V coefficient of variation.

Analysis of variance (ANOVA) for the alcohol content, TSS, TTA, pH, and viscosity quadratic models. *Significant at p ≤ 0.05. Fit statistics of the quadratic model for alcohol content, TSS, TTA, pH, and viscosity optimization. C.V coefficient of variation. The models for optimizing TTA and viscosity were insignificant as implied by low model F-values (F ≤ 10) and high p-values (p > 0.05) (Fig. 2a,b). Here, model reduction, consideration of outliers, and response transformation will not improve the model. The overall mean may be a better predictor of the designed responses than the current models. A higher-order model may also predict better in certain cases. None of the TTA optimization model terms were significant, while X1 and X12 were significant model terms (p ≤ 0.05) for viscosity. Both the models’ limitations were described by significant differences between the predicted determination. The TTA model had a pred-R2 of –3.34 and an adj-R2 of –0.09. Similarly, the model for viscosity had a pred-R2 of –1.05 and an adj-R2 of 0.49. In this case, a negative predicted determinant (pred-R2) implies that the overall mean may be a better predictor of the designed response than the current model[38]. A higher-order model may also predict better in certain cases. An adequate precision value of 2.99 in the TTA model indicated an undesirable signal-to-noise ratio. This means the model was not suitable to navigate the design space. The viscosity optimization model had an adequate precision above 4, meaning the model was suitable for navigating the design space. The low reproducibility of 42% (Table 5) for the TTA optimization model was indicated by a low coefficient of determination (R2 = 0.424). In contrast, the coefficient of determination for the viscosity was 0.729, representing a 73% reproducibility. Although the reproducibility can be considered adequate, a C.V. % value of 15.22 may be alarming (Table 5). From the obtained experimental data, second-order polynomial equations showing the significance of linear, quadratic, and interactive terms in predicting the response were generated and shown in Eqs. (2), (3), (4), (5), and (6) below:where Y1 = response for alcohol content (°P), Y2 = response for TSS (g/100 g), Y3 = response for TTA (% lactic acid), Y4 = response for pH, Y5 = response for viscosity (cm/min), X1 = Cooking time (hr), X2 = Fermentation temperature (°C), X3 = Fermentation time (hr).

The effect of input factors on the physicochemical properties of the optimal beer brew

Independent variables, cooking time (hr) coded as (X1), fermentation temperature (°C) coded as (X2), and time (hr) coded as (X3) were optimized. The optimization goal for all independent variables was set to ‘target’ as dictated by the nature of the study. The responses alcohol content (°P), TSS (g/100 g), TTA (% lactic acid), pH, and viscosity (cm/min) were considered for optimization. The software generated 100 optimization solutions each with a desirability value of 1. To select a suitable solution, prediction values of each solution were compared to prediction values of the constructed ANN. Yeast survival and proliferation, under-and-over cooking, shelf-life associated spoilage, and conditions’ applicability in real-life (study objectives) were also considered. Taking these variables into account, a solution that favored these considerations was selected. A cooking time of 1.1 h, fermentation temperature of 29.3 °C, and fermentation time of 25.9 h were optimal bioprocessing conditions. The parameters (alcohol content, TSS, TTA, pH, and viscosity) were subsequently investigated and the results are provided in Table 6. The customary brew (CB) was prepared by cooking the mixed ingredients for 30 min and leaving the cooked slurry to ferment at 25 °C for 24 h. The CB was then compared with the optimized brew (OPB).
Table 6

Physicochemical properties of umqombothi.

SampleAlcohol (°P)TSS (g/100 g)TTA (% lactic acid)pHViscosity (cm/min)
CB11.33 ± 0.21a10.90 ± 0.10a0.57 ± 0.02a4.23 ± 0.02b16.83 ± 0.76b
OPB13.63 ± 0.12b13.33 ± 0.21b0.68 ± 0.02b3.27 ± 0.03a11.50 ± 0.87a

CB customary brew, OPB optimized brew. Each value is a mean ± standard deviation of triplicates.

*Each value is a mean of triplicates ± standard deviation of triplicates. Means with no common letters within a row significantly differ (p < 0.05).

Physicochemical properties of umqombothi. CB customary brew, OPB optimized brew. Each value is a mean ± standard deviation of triplicates. *Each value is a mean of triplicates ± standard deviation of triplicates. Means with no common letters within a row significantly differ (p < 0.05). The OPB was found to have a low pH (3.27 ± 0.03) compared to the CB (4.23 ± 0.02) (Table 6). As a result, the OPB had a higher alcohol content (13.63 ± 0.12°P) and a higher TTA (0.68 ± 0.02% lactic acid). In preparing high-quality umqombothi, a 60 min cooking time has been suggested to be ideal[39]. A cooking time of 1.1 h did not under-/over-gelatinize the starch and provided adequate nutrients to yeasts cells[24]. In addition, the achieved gelatinization improved water absorption into the granules, thereby improving the viscosity[40]. This was reflected in the viscosity obtained for the OPB, which had more a desirable viscosity value compared to the CB (Table 6). A fermentation temperature of 29.3 °C was optimal for higher production of alcohol in the OPB (Table 6). A higher TSS in the OPB (Table 6) described the type of sugar conversion and its dependence on temperature for a rich, finished beer[41]. The slightly higher fermentation temperature and a relatively short fermentation time in the OPB appeared to improve the overall physicochemical properties of umqombothi. A fermentation time of 25.9 h was ideal for the fermentation rate and final beer.

ANN training, validation, and testing on experimental responses

An appropriate ANN construction involves the selection of network architecture, determination of hidden layers and number of neurons in each layer, learning—training—validation, and verification of the data[18]. In building a better ANN model, the number of the hidden layers between inputs and output must be appropriately trained and fitted[18]. To achieve this, the number of neurons in the hidden was varied (i.e., 5, 10, and 20 neurons in the hidden layer) (data not reported). To further study the responses of the model, three different training algorithms were evaluated. When 10 neurons in the hidden layer were used, all the algorithms rapidly generated solutions with high R and R2 values (data not reported). However, when the neurons were increased to 20, the number of reiterations increased in the BR algorithm, thus taking longer to generate a solution. In contrast, both the LM and SCG algorithms were not significantly affected by an increase or decrease in the number of neurons and maintained a higher rapidity in generating solutions. The SCG uses second-order approximation, resulting in fewer iterations and faster learning[42]. This may be due to the algorithm using a step-size scaling mechanism that avoids a timewasting line search per learning iteration[43,44]. Adequate training, validation, testing, and overall prediction accuracy were observed when the LM algorithm was used (Table 7). The LM algorithm which may be the fastest of the three training algorithms specifically works with loss functions presented in the form of a sum of squared errors (SSE)[45,46]. Unfortunately, LM cannot be applied to the cross-entropy error and the root mean squared error functions[46]. For functioning approximation problems, the LM training algorithm was able to obtain lower MSE than all other algorithms among regularization techniques. As a result, the LM is the recommended choice with better performance in terms of rapidity and the overfitting problem when there are a few thousand instances and a few hundred parameters for training the ANN[46,47]. In an unrelated study, the LM training algorithm was found to show the highest accuracy in comparison to different training algorithms in a MLP model that forecasted chemical elements distribution in the topsoil[45].
Table 7

Training, validation, and testing performance indices.

AlcoholTSSTTApHViscosity
MSERR2MSERR2MSERR2MSERR2MSERR2
Training0.011.001.000.011.000.990.000.980.960.351.001.000.210.990.99
Validation0.420.910.830.090.980.960.010.950.910.021.001.000.500.970.94
Testing0.330.770.600.450.970.930.080.720.521.440.530.284.970.870.76
Overall0.420.980.960.090.980.960.010.900.810.020.710.500.500.960.92

MSE mean squared error, R coefficient of correlation, R coefficient of determination, TSS total soluble solids, TTA total titratable acidity.

Training, validation, and testing performance indices. MSE mean squared error, R coefficient of correlation, R coefficient of determination, TSS total soluble solids, TTA total titratable acidity. The ANN training using the LM algorithm stopped automatically when generalization stopped, indicated by an increase in the MSE of the validation samples. In measuring performance indices of the ANN, the MSE is the most used and simplest error function[48,49]. The MSE measures the ability of the model to predict responses accurately, with a lower MSE showing a higher modeling ability[18]. In combination, R2 and MSE evaluated the overall accuracy of the model[18]. The coefficient of correlation (R) was used to measure the correlation between inputs and targets. R = 1 described a close relationship, and R = 0 described a random relationship. ANN models for alcohol content, TSS, TTA, and viscosity had overall R2 values of 0.96, 0.96, 0.81, and 0.92, respectively (Table 7). These values were closer to 1, suggesting high reliability in model prediction accuracy. The overall R2 value for pH was 0.50 representing a 50% reproducibility. Overall, a high correlation between inputs and targets was observed for alcohol content (0.98), TSS (0.98), TTA (0.90), and viscosity (0.96) (Fig. 3).
Figure 3

Response plots of the ANN: (A) Alcohol content (°P), (B) TSS (g/100 g), (C) TTA (% lactic acid), (D) pH, (E) viscosity (cm/min).

Response plots of the ANN: (A) Alcohol content (°P), (B) TSS (g/100 g), (C) TTA (% lactic acid), (D) pH, (E) viscosity (cm/min). Apart from MSE values, the ANN was further assessed using performance curves. Performance curves display the network’s incremental training process and the direction in which it learns. These curves plot training record error values against the number of training epochs. Consequently, the learning curve is a plot describing a model learning performance over time or experience[50]. Performance curves are useful in diagnosing problems with learning aspects such as unrepresentative training datasets, underfitting models, unrepresentative validation datasets, and overfitting models[50]. The ANN best validation performance curves for the responses are shown in Fig. 4. The ANN achieved the best learning and the lowest error after a few iterations (epochs). The best validation performance for each network was taken from the epoch with the lowest validation error. Both alcohol and TTA had the shortest iterations before achieving the best validation performance. In contrast, TSS achieved its best validation performance at epoch 5. After more epochs of training, the error is generally reduced but may start to increase on the validation dataset as overfitting of the training data occurs[51]. All the networks showed a good learning rate for the training stage and a high learning rate for the validation and testing stages[52]. In addition, both the training and validation showed a good fit displayed by training and validation MSE (loss) values which decreased to a point of stability with relatively nominal gaps between the two final MSE (or loss) values[50]. Overall better learning is described by error scores closer to 0, thus indicating that the training dataset was learned thoroughly and minimal mistakes were made[50].
Figure 4

Validation performance plot of the ANN: (A) Alcohol content (°P), (B) TSS (g/100 g), (C) TTA (% lactic acid), (D) pH, (E) viscosity (cm/min).

Validation performance plot of the ANN: (A) Alcohol content (°P), (B) TSS (g/100 g), (C) TTA (% lactic acid), (D) pH, (E) viscosity (cm/min).

Comparison between the RSM and ANN responses

An optimization prediction model developed by RSM was assessed by comparing its prediction accuracy with that of the ANN which was also used to validate the entire process. Table 8 shows the error comparison obtained from both and ANN predictions. The comparative error analysis was used to verify the prediction accuracy and generalization capacity of both models in optimizing the bioprocess[53,54]. Overall, the ANN model showed lower error values than the RSM, indicating lower computational deviations and an advanced generalization capability[11,54]. As a result, ANN displayed a higher prediction accuracy and better model fitting[18]. On the other hand, RSM prediction values can be accepted with a higher degree of confidence since they are closer to experimental values and ANN prediction values[18,55]. The results from Table 8 show a close correlation between the experimental values and RSM and ANN’s predicted values. Both RSM and ANN models showed a relatively high number of inexact predictions for viscosity.
Table 8

RSM and ANN predictions values.

RunAlcohol (°P)TSS (g/ 100 g)TTA (% lactic acid)`pHViscosity (cm/min)
ExpRSM PredError RSMANN PredError ANNExpRSM PredError RSMANN PredError ANNExpRSM PredError RSMANN PredError ANNExpRSM PredError RSMANN PredError ANNExpRSM PredError RSMANN PredError ANN
18.078.7380.24310.57− 0.107.378.1170.23710.300.001.181.1150.1340.470.032.902.9140.0324.600.0015.3314.1381.09412.85− 0.02
28.708.4240.2248.720.017.737.8080.2198.87− 0.501.201.1840.1241.02− 0.242.882.9010.0303.00− 0.0114.5015.2441.01014.50− 0.33
37.778.8730.4507.000.077.378.7320.4397.030.001.071.0230.2480.73− 0.052.913.4750.0602.810.0010.8315.1912.02914.47− 0.30
56.776.0720.4507.86− 0.096.636.1110.4397.370.000.810.8320.2481.050.022.902.8180.0602.910.0017.1714.1802.02810.93− 0.10
45.105.3920.4505.790.984.905.2200.4396.630.000.810.8240.2480.810.002.952.8830.0602.900.0019.1719.0262.02815.112.06
69.506.9130.4509.490.019.276.5890.4399.270.001.540.9830.2481.130.413.622.9760.0603.620.0022.6712.8802.02821.271.40
77.0010.2440.4508.50− 0.436.709.7950.4397.71− 0.340.720.8430.2481.20− 0.022.903.5490.0602.900.0025.0014.3102.02915.55− 0.22
810.479.9220.4505.77− 0.0410.309.7340.4395.600.000.501.2140.2480.77− 0.034.603.7120.0602.85− 0.0112.8319.4972.02915.28− 2.95
94.706.7910.4505.030.074.806.7570.4394.92− 0.020.720.5730.2480.99− 0.183.262.8200.0602.950.0016.5017.4072.02819.22− 0.05
108.738.7040.2338.500.208.378.2870.2287.710.020.781.1460.1291.200.002.992.9220.0312.90− 0.0214.1714.7611.05115.55− 1.05
118.878.4400.22110.02− 0.857.907.9270.2158.930.001.211.1570.1221.430.012.922.8810.0303.720.0116.1714.6150.99520.86− 0.03
128.139.3330.2648.500.377.878.9530.2587.710.191.131.1700.1461.200.012.913.2750.0352.900.0215.6714.9261.19015.550.62
138.439.4480.2417.32− 0.327.878.8700.2356.700.001.221.1130.1330.80− 0.082.903.3240.0322.890.0115.0014.6451.08425.28− 0.28
149.509.7290.2508.50− 0.379.609.1780.2447.710.160.791.1580.1381.20− 0.072.863.4890.0332.90− 1.9917.3316.2161.12515.550.12
157.076.6850.2578.50− 0.077.036.3180.2517.710.160.680.9990.1421.200.022.812.8480.0342.900.0014.1713.4871.15715.55− 0.55
169.279.0640.2209.480.029.038.4880.2159.600.001.351.2020.1210.98− 0.193.363.1160.0292.860.0014.3315.0470.99217.39− 0.06
179.176.8700.2354.480.228.936.4620.2294.800.001.441.0020.1290.690.033.732.8400.0313.030.2320.8313.6511.05716.70− 0.20
188.379.4660.23710.19− 0.128.108.9560.2319.500.001.251.2380.1311.06− 0.012.883.4820.0323.89− 0.3814.6717.3161.06916.86− 0.03
1910.077.0760.2368.50− 0.139.506.6120.2307.710.391.051.0540.1301.200.053.512.8250.0322.90− 0.0216.8313.6231.06115.55− 0.88
205.738.8130.2409.48− 0.215.608.3140.23510.13− 1.100.741.1340.1331.350.002.843.1130.0322.880.4812.3314.7041.08313.131.20

cm centimetre, Exp experimental, g gram, hr hour, min minute, ml millimetre, Pred predicted, TSS total soluble solids, TTA total titratable acidity.

RSM and ANN predictions values. cm centimetre, Exp experimental, g gram, hr hour, min minute, ml millimetre, Pred predicted, TSS total soluble solids, TTA total titratable acidity. The difference between predicted and experimental values directly contributed to the extent deviation in predictive capacity of each model. While RSM is recommended for modelling new processes, its sensitivity may be limited[55]. Despite this limitation, RSM has an obvious way of showing the effect of individual elements and their interactions on a specific system[11]. For example, the effect on a specific parameter is shown by a greater higher value of coefficients in ANOVA[57]. On the other hand, a higher number of inputs are required for ANN than RSM to have better predictions[55]. ANN cannot give such insights into the system directly since it is a ‘black box’[56]. Nonetheless, ANN can universally describe high-level interactions in non-linear systems without prior specification for suitable fitting function[55,57]. Additionally, ANN can calculate multi-responses in a single process[53]. As depicted by the close agreement between the experimental and predicted values, RSM and ANN are adequate for developing a bioprocess that optimally produces umqombothi. Advanced soft computing approaches like ANN may be preferred in the case of data sets with a limited number of observations in which regression models fail to capture reliably[18]. The closeness of the experimental values and predicted suggest that the non-linear fitting effects of the model are good, recommending the use of the proposed procedure[18,57]. A coupled modeling approach can thus be applied in bioprocess method development for umqombothi and related variations.

Conclusion

Both RSM and ANN were effective bioprocess development tools that facilitated the optimization of umqombothi. The effectiveness of RSM was shown by R2 closer to 1. The R2 values were 0.94, 0.93, 0.99, and 0.73 for alcohol, TSS, pH, and viscosity respectively, showing reliability and reproducibility above 70%. Similarly, ANN displayed a high degree of accuracy. Constructed ANN models for alcohol, TSS, TTA, and viscosity had R2 values of 0.96, 0.96, 0.81, and 0.92 respectively. As result, a good correlation between the experimental and predicted values suggests that a coupled approach may positively impact the bioprocess and the final product. However, further investigation of other key parameters (i.e., starter culture, the content and ratio of raw materials, souring time and temperature, and cooking temperature) is still required. Furthermore, the use of an additional tool such as genetic algorithm may resolve computational and modeling limitations.
  7 in total

Review 1.  Artificial neural networks: fundamentals, computing, design, and application.

Authors:  I A Basheer; M Hajmeer
Journal:  J Microbiol Methods       Date:  2000-12-01       Impact factor: 2.363

2.  Physicochemical properties of sorghum flour are selectively modified by combined germination-fermentation.

Authors:  Abd Elmoneim O Elkhalifa; Rita Bernhardt; Gaetano Cardone; Alessandra Marti; Stefania Iametti; Mauro Marengo
Journal:  J Food Sci Technol       Date:  2017-08-22       Impact factor: 2.701

3.  Kinetic modelling and optimisation of antimicrobial compound production by Candida pyralidae KU736785 for control of Candida guilliermondii.

Authors:  Maxwell Mewa-Ngongang; Heinrich W du Plessis; Ucrecia F Hutchinson; Lukhanyo Mekuto; Seteno Ko Ntwampe
Journal:  Food Sci Technol Int       Date:  2017-01-01       Impact factor: 2.023

4.  Mycobiota and co-occurrence of mycotoxins in South African maize-based opaque beer.

Authors:  Ifeoluwa Adekoya; Adewale Obadina; Cynthia Chilaka Adaku; Marthe De Boevre; Sheila Okoth; Sarah De Saeger; Patrick Njobeh
Journal:  Int J Food Microbiol       Date:  2018-02-09       Impact factor: 5.277

5.  Improving the Quality of Response Surface Analysis of an Experiment for Coffee-Supplemented Milk Beverage: I. Data Screening at the Center Point and Maximum Possible R-Square.

Authors:  Sungsue Rheem; Sejong Oh
Journal:  Food Sci Anim Resour       Date:  2019-02-28

6.  Artificial Neural Networks (ANNs) and Response Surface Methodology (RSM) Approach for Modelling the Optimization of Chromium (VI) Reduction by Newly Isolated Acinetobacter radioresistens Strain NS-MIE from Agricultural Soil.

Authors:  Nur Syuhadah Ram Talib; Mohd Izuan Effendi Halmi; Siti Salwa Abd Ghani; Uswatun Hasanah Zaidan; Mohd Yunus Abd Shukor
Journal:  Biomed Res Int       Date:  2019-05-26       Impact factor: 3.411

7.  Impact of changes in fermentation time, volume of yeast, and mass of plantain pseudo-stem substrate on the simultaneous saccharification and fermentation potentials of African land snail digestive juice and yeast.

Authors:  P U Amadi; M O Ifeanacho
Journal:  J Genet Eng Biotechnol       Date:  2016-10-07
  7 in total
  1 in total

1.  Physicochemical, structural analysis of coal discards (and sewage sludge) (co)-HTC derived biochar for a sustainable carbon economy and evaluation of the liquid by-product.

Authors:  Gentil Mwengula Kahilu; Samson Bada; Jean Mulopo
Journal:  Sci Rep       Date:  2022-10-20       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.