Aida Alizamir1, Amin Gholami2, Nader Bahrami3, Mehdi Ostadhassan4,5,6. 1. Department of Pathology, School of Medicine, Hamadan University of Medical Science, Hamadan 6517838738, Iran. 2. Reservoir Division, Iranian Offshore Oil Company, Tehran 1966653943, Iran. 3. Financial Transaction Department, Carsome Company, Petaling Jaya, Selangor 47800, Malaysia. 4. Department of Geology, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran. 5. Institute of Geosciences, Marine and Land Geomechanics and Geotectonics, Christian-Albrechts-Universität, Kiel 24118, Germany. 6. Key Laboratory of Continental Shale Hydrocarbon Accumulation and Efficient Development, Ministry of Education, Northeast Petroleum University, Daqing 163318, China.
Abstract
Hemoglobin is one of the most important blood elements, and its optical properties will determine all other optical properties of human blood. Since the refractive index (RI) of hemoglobin plays a vital role as a non-invasive indicator of some illnesses, accurate calculation of it would be of great importance. Moreover, measurement of the RI of hemoglobin in the laboratory is time-consuming and expensive; thus, developing a smart approach to estimate this parameter is necessary. In this research, four viable strategies were used to make a quantitative correlation between the RI of hemoglobin and its influencing parameters including the concentration, wavelength, and temperature. First, alternating conditional expectations (ACE), a statistical approach, was employed to generate a correlation to predict the RI of hemoglobin. Then, three different optimized intelligent techniques-optimized neural network (ONN), optimized fuzzy inference system (OFIS), and optimized support vector regression (OSVR)-were used to model the RI. A bat-inspired (BA) algorithm was embedded in the formulation of intelligent models to obtain the optimal values of weights and biases of an artificial neural network, membership functions of the fuzzy inference system, and free parameters of support vector regression. The coefficient of determination, root-mean-square error, average absolute relative error, and symmetric mean absolute percentage error for each of the ACE, ONN, OFIS, and OSVR were found as the measure of each model's accuracy. Results showed that ACE and optimized models (ONN, OFIS, and OSVR) have promising results in the estimation of hemoglobin's RI. Collectively, ACE outperformed ONN, OFIS, and OSVR, while sensitivity analysis indicated that the concentration, wavelength, and, lastly, temperature would have the highest impact on the RI.
Hemoglobin is one of the most important blood elements, and its optical properties will determine all other optical properties of human blood. Since the refractive index (RI) of hemoglobin plays a vital role as a non-invasive indicator of some illnesses, accurate calculation of it would be of great importance. Moreover, measurement of the RI of hemoglobin in the laboratory is time-consuming and expensive; thus, developing a smart approach to estimate this parameter is necessary. In this research, four viable strategies were used to make a quantitative correlation between the RI of hemoglobin and its influencing parameters including the concentration, wavelength, and temperature. First, alternating conditional expectations (ACE), a statistical approach, was employed to generate a correlation to predict the RI of hemoglobin. Then, three different optimized intelligent techniques-optimized neural network (ONN), optimized fuzzy inference system (OFIS), and optimized support vector regression (OSVR)-were used to model the RI. A bat-inspired (BA) algorithm was embedded in the formulation of intelligent models to obtain the optimal values of weights and biases of an artificial neural network, membership functions of the fuzzy inference system, and free parameters of support vector regression. The coefficient of determination, root-mean-square error, average absolute relative error, and symmetric mean absolute percentage error for each of the ACE, ONN, OFIS, and OSVR were found as the measure of each model's accuracy. Results showed that ACE and optimized models (ONN, OFIS, and OSVR) have promising results in the estimation of hemoglobin's RI. Collectively, ACE outperformed ONN, OFIS, and OSVR, while sensitivity analysis indicated that the concentration, wavelength, and, lastly, temperature would have the highest impact on the RI.
Hemoglobin is a vital
protein in human blood cells that is responsible
for carrying oxygen between the lungs and the rest of the tissues
in the human body. This process will allow aerobic respiration to
provide energy to the cells for their metabolism.[1,2] There
are two distinct forms of hemoglobin, that is, hemoglobin exist in
oxygenated (O) and deoxygenated (DO) states based on their capability
to reversibly bind up to four oxygen molecules.[1] Besides, hemoglobin is a biological marker to clinically
diagnose different medical conditions mainly because of the close
relationship between the pathophysiology of various diseases and the
red blood cells.[3−6] Considering the shape and size of hemoglobin, its optical properties
would significantly affect the optical attributes of the entire blood.[7] Thus, optical properties of hemoglobin can provide
us with valuable information which is extremely useful in medical
diagnosis[4] and the patient’s overall
health conditions. At the same time, its role in therapeutic applications,
particularly laser medicine, is highly valuable. Moreover, the RI
of hemoglobin, which can be found through optical methods in the laboratory
using a refractometer, can reveal the true nature of the blood cell
disorder.[8−10] However, experimental measurement of this parameter
in the laboratory is highly expensive, labor-intensive, and time consuming.
This has promoted the development of empirical models based on statistical
approaches instead to obtain the RI more rapidly,[11] while many of such models have demonstrated a poor performance.Based on what was said above, machine learning (ML), which has
been proven to be an efficient method to replace common statistical
approaches and a new modeling tool in solving complicated regression
and classification problems, could be used to estimate the RI of hemoglobin
as well. In this realm, in the classification task, a number of studies
were conducted to successfully classify the blood cells.[12−17] Tomari et al.(2014) used an artificial neural network (ANN) to classify
red blood cells as normal/abnormal.[12] Kultu
et al. (2020) proposed a convolutional neural network (CNN) to identify
and locate white blood cell types in blood images, which led to an
increase in the performance of existing blood test devices.[13] Toğaçar et al. (2020) adopted
CNN models to improve the classification success of white blood cell
types.[14] To address the issue of multiple
cell overlap in white blood cell image classification, Patil et al.
(2021) combined the deep learning approach (merging of the CNN and
recurrent neural network model) with canonical correlation analysis.[15] Girdhar et al. (2022) also employed the CNN
model to classify white blood cell.[16] Davamani
et al. (2022) developed fuzzy c-means clustering for blood cell classification.[17] In addition, others have developed AI-based
models to segment blood vessels.[18−25] Wang et al. (2015) integrated CNN and random forest (RF) for retinal
blood vessel segmentation.[19] Soomro et
al. (2019) utilized deep CNN for segmenting retinal blood vessels.[20] In their paper, in order to generate contrast
images for training data sets, morphological mappings along with the
principal component analysis-based pre-processing steps were used.
Zhang et al. (2020) used a U-net-based deep learning approach to track
and segment brain blood vessels in digital subtraction angiography
images.[21] Tchinda et al. (2021) proposed
a potent strategy based on classical edge detection filters and ANNs
for the segmentation of blood vessels in retinal photographs.[22] In their paper, edge detection filters were
first applied to extract the feature vector. Then, the resulting features
were used to train an ANN in order to recognize each pixel as belonging
to blood vessels or not. Gegundez-Arias et al. (2021) developed a
new deep learning method for blood vessel segmentation in retinal
images based on convolutional kernels and a modified U-net model.[23] Deng and Ye (2022) performed retinal blood vessel
segmentation based on an improved deformable convolutional M-shaped
network and a pulse-coupled neural network.[24] Zhang et al. (2022) presented a novel automatic method based on
bridge-net by joint learning context-involved and non-context features
for the segmentation of retinal blood vessels.[25] In the regression task, researchers have attempted to develop
intelligence-based approaches to reliably predict blood pressure.[26−31] Xu et al. (2017) presented a capable methodology based on ANN for
continuous blood pressure estimation based on multiple parameters
from electrocardiogram and photoplethysmogram.[26] Senturk et al. (2020) constructed a predictive model based
on dynamic recurrent neural networks to estimate non-invasive continuous
cuffless blood pressure.[27] Esmaelpoor et
al. (2020) proposed a two-step strategy for blood pressure estimation
using photoplethysmogram signals.[28] In
the first stage, they employed CNN to extract morphological features
from each photoplethysmogram segment and then to estimate systolic
and diastolic blood pressure separately. In the second stage, they
used long short-term memory (LSTM) to capture temporal dependencies.
Baker et al. (2021) applied a hybrid neural network for continuous
and non-invasive estimation of blood pressure from raw electrocardiogram
and photoplethysmogram waveforms.[29] Qiu
et al. (2021) proposed a hybrid neural network architecture, which
contained a CNN-Sequential-Adapt layer, a ResNet25_BP layer with squeeze
and excitation blocks and fully connected layers, for blood pressure
estimation.[30] Cheng et al. (2021) employed
fully CNN for prediction of arterial blood pressure waveforms from
photoplethysmogram signals.[31] Others attempted
to carefully evaluate the applicability of ML in the estimation of
blood glucose levels.[32−36] Ben Ali et al. (2018) used ANN for continuous blood glucose level
prediction of type 1 diabetes.[32] D’Antoni
et al. (2020) constructed an auto-regressive time-delayed jump neural
network for blood glucose level prediction.[33] Alfian et al. (2020) applied ANN, support vector regression, K-nearest
neighbor, decision tree, RF, adaptive boosting, and extreme gradient
boosting models for blood glucose prediction of type 1 diabetes.[34] They compared results of predictive models and
concluded that ANN outperforms other intelligence-based models. Dudukcu
et al. (2021) predicted blood glucose by virtue of deep neural networks.
LSTM, WaveNet, and gated recurrent units, and decision-level combinations
of these architectures were deep learning methods which were used
to predict blood glucose.[35] Zhang et al.
(2021) adopted deep learning and regression approaches to forecast
blood glucose levels for type 1 diabetes.[36] Four data-driven models including different neural network architectures,
a reservoir computing model, and a linear regression approach were
employed in their study. Considering the RI estimation of hemoglobin,
AI methods have shown some promising results compared to common statistical
approaches.[37,38] All in all, these studies confirm
the applicability of ML and AI methods in the analysis of blood from
various perspectives, while the estimation of RI, despite its importance,
has been rarely done via intelligent techniques.In this study,
four techniques are presented to estimate the RI
of hemoglobin. To do so, alternating conditional expectation (ACE)
was first employed to carry out a quantitative estimation of the RI
of hemoglobin. This method correlates inputs to outputs through optimally
transferring dependent and independent variables into high-dimensional
space. Next, three optimized models, viz., the optimized neural network
(ONN), optimized fuzzy inference system (OFIS), and optimized support
vector regression (OSVR) were used to model the RI. In neural network
equations, a bat-inspired algorithm (BA) was embedded to achieve global
minima (optimal value of weight and bias), where the neural network
can operate at its maximum capacity. In the fuzzy inference system,
BA was employed to optimize membership functions of a fuzzy inference
system established in the structure of the Sugeno fuzzy inference
system type. Moreover, BA was involved in the support vector regression
equations to improve its precision by means of determining the optimal
values of free parameters. Finally, sensitivity analysis (SA) was
performed to obtain information related to the contribution of each
input in every model. This SA will enable us to understand how each
parameter would affect the outputs of every model.
Model Description
Alternating Conditional
Expectations
Recently, Breiman and Freidman formulated a
nonparametric approach
known as ACE for solving complex regression problems.[39] When the functional form between dependent variables and
independent ones is implicit, this method is far superior to its rivals
in identifying principal nonlinear relationships of input/output space.[40−43] This exceptional feature motivated scientists to use ACE in order
to find solutions to complicated problems which were impossible to
be solved by the virtue of conventional regression methods.[39] A full description of ACE can be found in the
original paper of Breiman and Freidman,[39] but in general a linear regression model for p independent
variables X1, X2,......, Xp and a response variable Y can be mathematically shown as follows.[39]In the above
equation, regression coefficients
(δ, i = 0–p) and the error term (ε) are determined during the
regression analysis. Compared to the linear regression method, ACE
transforms dependent and independent variables into a high-dimensional
space and solves the following regression equation in those spaces.[39]where ϑ(Y) and ξ1(X1),....,ξp(Xp) are
arbitrary measurable mean-zero functions of the Y, X1, X2,...., Xp, respectively. For achieving the best model
via ACE, optimal transformation (OT) ξ1*(Xi), i = 1,...,p and ϑ*(Y) which changes the
dimension of the data input/output space must be extracted. In this
desired space, there is a maximum correlation between transformed
dependent variables and the sum of the transformed predicted variables.
With the above objective in consideration, the following equation
must be minimized.[39]In order to minimize
the above equation,
Breiman and Freidman took
advantage of a series of single-function minimizations and proposed
the following equations for the response variable and predictor, respectively:[39]By iteratively changing the values
of the eqs and 5, optimal transformation
ξ1*(X), i = 1,...,p, and ϑ*(Y), where the value of
ε2 in eq has the minimum values, will be extracted. In the desired space,
the response value is related to the predictor variables with the
following equation.[39]Here, e*
is an undesirable error that
ACE is unable to capture.[39]
Optimized Neural Network
A neural
network is an algorithm that is based on biological neural networks
of the human brain, and it has been proven useful for modeling intricate
regression tasks.[44] Exactly analogous to
the human brain, neural networks are incredibly intelligent to effectively
train from a number of observed data in the absence of an initially
defined mathematical relationship between dependent and independent
variables. Many scientists and engineers have recently been applying
this mathematical technique to find out the complicated relationships
of the input/output space.[45−49] Considering its capability, it is a prospective candidate to find
solutions for nonlinear problems.[50−53] Three layers are defined in the
structure of a neural network, including input, hidden, and output
layers. The input layer is devoted to receiving the input data, while
output layers are used to generate the final result. The hidden layer
is employed to obtain the dependence of output data on the input data.
For the training task, this approach calculates the value of output
from the input data through the entire network and then determines
the difference between estimated values and the corresponding observed
values. Subsequently, this difference is propagated backward through
the network as an error, thereby adjusting weights and biases. The
training phase is considered fulfilled when the error reaches a minimum,
which yields the best predictive model. Although the neural network
is a promising and applicable technique to model nonlinear problems,
it may pose some challenges in distinguishing the global minima from
the local minima and can produce misleading results.[54−57] To effectively solve this issue, scientists have improved the performance
of neural networks through hybridization of optimization algorithms
and this intelligent model.[58−60] Considering this solution, in
this research, the structure of neural networks incorporates BA to
find out optimal values of their weights and biases where the problem
is preferably trapped in the global minima. A schematic diagram of
the BA optimally determining the weights and the bias of a neural
network is illustrated in Figure . In this study, ONN was generated via MATLAB programming.
Figure 1
Schematic
diagram of the BA optimally determining the weights and
the bias of the neural network.
Schematic
diagram of the BA optimally determining the weights and
the bias of the neural network.
Optimized Fuzzy Inference System
A fuzzy
inference system is a methodology established from Zadeh’s
fuzzy set theory, which has been proved notably practical in modeling
nonlinear systems.[61] This approach enables
scientists with mathematical computation to handle uncertainty in
accordance with “degrees of truth” in lieu of “true
or false,” which is generally carried out in the logic of the
calculation. In this approach, each membership function represents
a fuzzy set. A fuzzifier, an inference engine (or a fuzzy rule base),
and the defuzzifier are three principal constituents of fuzzy inference
systems. The fuzzifier employs a membership function to map a set
of data inputs to intervals of 0 and 1. The purpose of the inference
engine as a second part of the fuzzy inference system is to apply
fuzzy rules to the converted output data of the fuzzifier. Outputs
of the inference engine are integrated into one fuzzy output distribution
and eventually transferred into a crisp output by virtue of the output
membership function in the defuzzifier process.[62] In the current work, the Gaussian membership function is
used as an input, and a linear polynomial function is employed as
an output membership function. The two most capable types of fuzzy
inference systems are Mamdani and Sugeno, where Sugeno is utilized
to perform the modeling task in this study. The Sugeno model is composed
of “if–then” rules in the following general form[62]where n refers to the number
of the rules; I is the
input of the fuzzy inference system; Mnm is the Gaussian input membership function of mth input data and nth rule, and Z = f (I) is a function in the series. As mentioned above, the output
membership function f (I) is a linear polynomial in the input variables.
The output level, Z,
is weighted by the firing strength for each rule, and finally, the
overall output is estimated via a weighted average operator. One of
the most important tasks in modeling a regression problem is to find
the optimum values of membership functions since these values play
a critical role in the accuracy of the generated model.[63,64] Therefore, to increase the efficiency of the fuzzy inference system
in mapping the functional dependency between RI and its input variables,
both BA and the fuzzy inference system were integrated in this study
to reach the best values for the membership function. The main idea
of implementing such a combination is presented in Gholami et al.[56] In the current paper, OFIS was built in the
MATLAB environment.
Optimized Support Vector
Regression
Support vector regression, which was proposed
by Vapnik, is a type
of intelligent model based on statistical theory to drive equations
for relating input variables and outputs.[65] The estimation approach in this function is a refinement of a support
vector machine to solve knotty regression tasks. Recently, this method
has attracted much attention because it showed significant ability
in solving diverse regression problems with excellent accuracy.[66−69] In comparison to neural networks, this technique is superior in
generalization domains which originated from how it minimizes the
risk. In this regard, support vector regression applies the structural
risk minimization principle to reduce the upper bound on expected
risk, while neural networks use the empirical risk minimization principle
to decrease the error on the training data.[70] In order to transfer nonlinear learning problems into linear ones,
support vector regression utilizes kernel function in its formulation.
Based on the literature, four dominant kernel functions that are generally
used for this transformation are linear, polynomial, radial basis
function, and sigmoid. However, it has been found that the radial
basis function is a better kernel to be used in regression problems.
Since the influence of penalty parameters of support vector regression
on the performance of a constructed model is greatly high, it is considered
as one of the most challenging issues while training the model.[71−83] Thus, finding the values of these parameters plays a key role in
successfully applying this method for solving knotty problems. In
this study, BA was combined with support vector regression to improve
its efficiency by calculating the optimum value of free parameters
while OSVR was constructed in the MATLAB environment.
Bat-Inspired Algorithm
Yang developed
the BA as the most modern meta-heuristic algorithm with the ability
to perform a global search to manage different sets of optimization
problems.[84] The purpose of this method
is to find the best solution to the optimization problem by simulation
of bats’ behavior in finding food and prey. Bats use their
hearing ability to estimate the location of their surrounding matters,
which is the foundation of establishing the BA method.[84] Many scientists conducted research to compare
the simplicity, speed, and ability of BA with other optimization tasks.[74,78] Results show that BA performs better than other approaches. This
is due to the method’s excellent potential to substantially
decrease the error encountered with the estimation.[78] In this paper, BA was used as an optimization algorithm
to enhance the efficiency of the intelligent models by finding the
optimal values of those parameters.
Data Input/Output
Space
To build and
evaluate the predictive models in this paper, experimental
data of hemoglobin’s RI of humans are obtained from Yahya and
Saghir.[11] They explain that since it is
difficult to keep real human blood samples intractable (a condition
when the samples have homogeneous hemoglobin concentration and optical
stability), other samples resembling the blood from freeze-dried human
hemoglobin powder dissolved in phosphate buffered saline (PBS) were
utilized as well. To get blood solutions, a specific mass of lyophilized
blood was dissolved in PBS which acted as a solvent. PBS keeps the
pH of the blood solutions at 7.4, which is very important since the
RI is affected by changes in the pH. A precise Peltier thermostat
and a multiwavelength Anton Paar WR refractometer were used to measure
the temperature variations and the RI of human blood samples of different
wavelengths (436.1–657.2 nm), respectively. Snell’s
law is the basic principle of the device, and it specifies the critical
angle of the total internal reflection of the investigated sample.
A sensor array was employed to detect the projected beam, so the RI
of blood samples can be calculated. In fact, three selected features
of concentration, wavelength, and temperature were considered as input
data to construct the correlation.[11] It
is important to mention that 80% (384 sample points) of the data points
were selected to train the model and the remaining 20% (96 sample
points) as the test data set to examine the performance of the constructed
model. In addition to this data set, two other data sets from Zhernovaya
et al.[85] were utilized to check the validation
of the developed models. Each data set has 126 data points. In Zhernovaya
et al.,[85] Sigma-Aldrich’s human
hemoglobin (lyophilized powder) was used to obtain experimental results.
Similar to Yahya and Saghir,[11] PBS is employed
to dissolve human hemoglobin to keep it at a pH of 7.4 and to avoid
changes in the RI from pH variations. To measure RI, the digital multiwavelength
refractometer DSR-λ was employed where the RI was obtained by
the measurements of the angle of total internal reflection.[11] Measurement was conducted for solutions of DO
and O hemoglobin, which were obtained from adding sodium dithionite
and sodium bicarbonate, respectively, to all samples. Statistical
analysis data for input and output parameters, including minimum,
maximum, and mean value, mode, standard deviation, skewness, kurtosis,
and coefficient of variation of collected data, is tabulated in Table S1 of Supporting Information.
Results and Discussion
In this study,
the coefficient of determination (R2),
mean square error (MSE), average absolute relative
error (AARE), and symmetric mean absolute percentage error (SMAPE)
were used to make decisions about the accuracy of the developed models.
Their expressions are given in eqs –11 as follows:Coefficient of determination (R2)[54]MSE[54]AARE[54]SMAPE[79]where Yi obs is the measured value of sample i, Yi pred is the predicted
value of sample i, Ŷobs is the average of the measured
value, and n is the number of samples. When the values
of AARE, SMAPE, and MSE are close to 0, as well as the value of R2 is close to 1, a model with superb performance
is achieved. It is worth mentioning that in the training stage of
the constructed optimized models (ONN, OFIS, and OSVR), a fourfold
cross-validation technique was employed to train intelligence-based
models and generate predictive models which produce stable results.
ACE Results
The ACE method was used
to generate a model to quantitatively estimate the RI of hemoglobin.
To achieve this, ACE optimally transfers input parameters and target
values to a high-dimensional space. Figure illustrates the optimal transformations
for input and output variables of training data points (384 data points).
In this high-dimensional space, transformed dependent variables and
the sum of transformed predicted variables have a maximum correlation.
Based on Figure ,
we can understand how the optimal transformation of each variable
is related to its value.
Figure 2
Optimal transformation of (a) concentration,
(b) temperature, (c)
wavelength, and (d) RI. Via curve fitting, equations that relate the
value of optimal transformation of each parameter with its value were
mathematically obtained.
Optimal transformation of (a) concentration,
(b) temperature, (c)
wavelength, and (d) RI. Via curve fitting, equations that relate the
value of optimal transformation of each parameter with its value were
mathematically obtained.To construct ACE models,
equations relating each
independent parameter
to its transformation must be carefully extracted. In this study,
this task was performed through a curve-fitting toolbox in the MATLAB
environment. In this study, developed equations which elicit identical
transformed values of each parameter are simple polynomial expressions.
These mathematical equations for independent parameters (concentration
(C), wavelength (W), and temperature
(T)) are given as followswhere L is the value of non-transformed
input of a model and Tr is the optimal transformation of the L parameter.
Coefficients that were determined through a curve-fitting tool and
were extracted for calculating optimal transformations of each independent
variable are tabulated in Table .
Table 1
Polynomial Coefficients for Determining
Optimal Transformation of Each Input Variable
Relationship
between the estimated RI and the sum of the transformed
input variables. This linear function leads to the calculation of
hemoglobin’s RI from optimal transformation of independent
variables.
Relationship
between the estimated RI and the sum of the transformed
input variables. This linear function leads to the calculation of
hemoglobin’s RI from optimal transformation of independent
variables.In this step, the equation which
relates the value
of RI and the
summation value of the input’s optimal transformations must
be extracted. This task was done via the curve-fitting toolbox in
the MATLAB environment. As shown in this graph, RI has a linear dependency
on the sum of input’s optimal transformations. The following
equation, which relates RI to the sum of the input’s optimal
transformations, was derived through curve fittingThrough this relationship,
the equation
that relates the value
of RI to the summation of optimal transformations was determined.
Substituting eqs and 13 in eq , the correlation that relates RI to input parameters can
be developed as followsThe coefficients
of the above equation
are given in Table .
Table 2
Polynomial Coefficients of the Archived
ACE Model for Estimation of RI
parameter
value
α
0.000153666146350
β
–0.000101861894337
δ
–0.000162023650231
λ
0.000000110738826
μ
1.421225072812270
This equation represents how input parameters
can
be used for the
prediction of hemoglobin’s RI. In this stage, test data (84
data points) and validation data (126 data points of DO hemoglobin
and 126 data points of O hemoglobin) were fed into eq , and the values of RI for these
data points were obtained. Figure a demonstrates the cross-plot between actual RI and
the RI values, which were estimated by the ACE approach for training,
testing, and validation data points. As can be seen in this figure,
ACE results have a perfect fit with observed data.
Figure 4
Cross-plot of estimated
vs measured values for (a) ACE, (b) ONN,
(c) OFIS, and (d) OSVR. This figure was plotted for training, testing,
and two validation data sets. Based on this figure, it can be concluded
that constructed models have acceptable fitness with experimental
data.
Cross-plot of estimated
vs measured values for (a) ACE, (b) ONN,
(c) OFIS, and (d) OSVR. This figure was plotted for training, testing,
and two validation data sets. Based on this figure, it can be concluded
that constructed models have acceptable fitness with experimental
data.For a better comparison, in addition
to Figure a, the correspondence
of predicted RI with
actual RI is assessed in each sample number (Figure ). It is completely evident that calculated
RI has an exact match with the experimental data.
Figure 5
ACE-predicted RI in each
sample number for (a) testing data, (b)
validation (O), and (c) validation (DO). It is observed that estimated
RI provides a good match with the measured values.
ACE-predicted RI in each
sample number for (a) testing data, (b)
validation (O), and (c) validation (DO). It is observed that estimated
RI provides a good match with the measured values.The results of this correlation were carefully
assessed based on
the statistical criteria defined in the previous part, and their results
are listed in Table .
Table 3
Statistical Evaluation of Constructed
Models
model
allocation
R2
MSE
AARE
SMAPE
ACE
training
0.9982336306
0.0000001065
0.0001908349
0.0000954193
testing
0.9982064604
0.0000001291
0.0002052358
0.0001026208
validation (O)
0.9717864827
0.0000016910
0.0007301520
0.0003652859
validation (DO)
0.9743329323
0.0000015765
0.0007440273
0.0003722085
ONN
training
0.9952394041
0.0000002871
0.0003215436
0.0001607715
testing
0.9941302810
0.0000004224
0.0003684964
0.0001842561
validation (O)
0.9499491259
0.0000029999
0.0009546467
0.0004777142
validation (DO)
0.9410967048
0.0000036180
0.0009860192
0.0004934819
OFIS
training
0.9941736595
0.0000003514
0.0003561574
0.0001780796
testing
0.9888749901
0.0000008006
0.0004202847
0.0002102310
validation (O)
0.9677079225
0.0000019355
0.0007854747
0.0003929896
validation (DO)
0.9686669983
0.0000019245
0.0008053153
0.0004029079
OSVR
training
0.9944979014
0.0000003318
0.0003317496
0.0001658630
testing
0.9946500649
0.0000003850
0.0003705688
0.0001852698
validation (O)
0.9493983049
0.0000030329
0.0010064166
0.0005035914
validation (DO)
0.9354325863
0.0000039659
0.0010239630
0.0005124898
The output of statistical evaluation
in all allocation
of data
(training, testing, validation (O), and validation (DO)) confirms
that ACE has a very good performance in modeling hemoglobin’s
RI and therefore should be considered for quantitative estimation
of RI.
ONN Results
In this part, a neural
network combined with BA was used to build a model to determine the
value of hemoglobin’s RI. Training data points (384 data points)
were used in the training stage to obtain the parameters of ONN. BA
successfully keeps neural networks away from being trapped in local
minima instead of global minima. Hence, ONN was developed based on
the oversight to train the models. BA regulation parameters for ONN
are given in Table .
Table 4
BA Regulation Parameters for Optimized
Models
value
parameter
ONN
OFL
OSVR
number of variables for
optimization
26
200
3
population size
100
400
10
maximum iteration
500
500
500
A0
0.5
0.5
0.5
r0
0.5
0.5
0.5
fmin
0
0
0
fmax
2
2
2
In the constructed model,
a hyperbolic tangent sigmoid
was considered
as a mapping function in hidden and output layers. Figure a depicts the process of BA
running for optimizing the proposed neural network, leading to optimal
calculation of weights and biases of the neural network.
Figure 6
BA runs for
optimization of (a) ONN, (b) OFIS, and (c) OSVR.
BA runs for
optimization of (a) ONN, (b) OFIS, and (c) OSVR.One of the main issues in constructing a neural
network model to
implement in a regression task is to find out the optimum number of
neurons in hidden layers. This number has a significant effect on
the accuracy and complexity of developed models. Therefore, it is
pretty crucial to precisely determine this number. Further analysis,
which is depicted in Figure , was used for finding this optimum number, and based on this
figure, it can be clearly deduced that 5 is the best value for the
number of neurons that must be included in the hidden layer.
Figure 7
Statistical
measurement for finding the best value for the number
of neurons in the hidden layer. It is evident that 5 would be the
best number since it makes the neural network have the maximum correlation
of determination as well as minimum MSE.
Statistical
measurement for finding the best value for the number
of neurons in the hidden layer. It is evident that 5 would be the
best number since it makes the neural network have the maximum correlation
of determination as well as minimum MSE.The optimum values of weights and biases of ONN
are given in Table .
Table 5
ONN Optimum Parameters Determined
via BA
layer
weights
node
input 1
input 2
input 3
biases
hidden layer
node 1
2.3022
–0.1883
–0.3005
–2.1802
node 2
–1.9735
–0.5759
–1.0198
0.4422
node 3
0.3176
–0.7212
0.0201
–0.9432
node 4
–0.4376
0.0769
0.0575
–0.1849
node 5
1.4397
2.0913
–0.0002
2.1387
After constructing the model, test data and validation
data were
employed to assess the validity of the models. In Figure b, a comparison between the
outputs of ONN and real values is presented in a cross-plot. Considering
this figure, ONN has acceptable goodness-of-fit in predicting the
RI. In Figure , the
capability of ONN is displayed in each of the sample points. According
to this figure, the ONN method is effective since observed and calculated
data are very close. Moreover, statistical measurements of ONN outputs
are listed in Table .
Figure 8
ONN-predicted RI in each sample number for (a) testing data, (b)
validation (O), and (c) validation (DO), which confirms the validity
of this method.
ONN-predicted RI in each sample number for (a) testing data, (b)
validation (O), and (c) validation (DO), which confirms the validity
of this method.
OFIS
Results
At first, training data
points were adapted to learn the model and find its parameters. As
mentioned before, improper regulation of membership functions reduces
the efficiency of fuzzy inference systems. To avoid this problem,
BA is integrated with a fuzzy inference system in order to reach the
optimal values of its membership function. BA regulation parameters
of OFIS are provided in Table . Results of running BA to disclose these suitable values
are presented in Figure b. Moreover, statistical analysis for determining the best value
of cluster radius is shown in Figure . This figure displays that 0.4 is the optimum value
of the cluster radius.
Figure 9
Statistical evaluation for extracting the cluster radius.
Based
on this figure, it can be found that 0.4 is the best value for the
cluster radius to achieve the best models.
Statistical evaluation for extracting the cluster radius.
Based
on this figure, it can be found that 0.4 is the best value for the
cluster radius to achieve the best models.After deciding the optimal value of the OFIS model,
test data (96
data points) and validation data (252 data points) were imported in
the constructed model, and the value of RI in these data points was
calculated. After the RI is predicted and compared with real values
(Figure c), it can
be concluded that OFIS is a well-performing technique that can provide
satisfactory results. Besides, in each sample point, the predicted
and real values were compared (Figure ). This figure illustrates capability of
OFIS in correlating RI with its conditioning parameters.
Figure 10
OFIS-predicted
RI in each sample number for (a) testing data, (b)
validation (O), and (c) validation (DO). For all portions of the data,
outputs of OFIS highly match with measured values.
OFIS-predicted
RI in each sample number for (a) testing data, (b)
validation (O), and (c) validation (DO). For all portions of the data,
outputs of OFIS highly match with measured values.Table presents
the statistical indexes of this model. According to this modeling
scheme, OFIS is totally proper for the driving equation that proves
the functional dependency between RI and its influencing parameters.
OSVR Results
To use OSVR in our modeling
of RI, first, training data including 384 data points were employed
to train the model. As mentioned in Section , there is a possibility that penalty parameters
in support vector regression were adjusted inappropriately. To manage
to successfully overcome this major deficiency in this study, support
vector regression was merged with BA and the accuracy of the model
was improved through extracting the optimum values of free parameters.
BA regulation parameters for OSVR are summarized in Table . The BA performance is depicted
in Figure c, and the
optimal parameters of the support vector regression model are listed
in Table .
Table 6
Optimum Parameters of OSVR Determined
by BA
parameter
value
gamma
7.72 × 10–5
C
1.8937
epsilon
0.0015
After finding the optimal values of OSVR models, test
data and
validation data sets were included in constructing the model, and
the values of RI were found. Figure d demonstrates the estimated RI values from OSVR vs
the measured values of RI. In Figure , the comparison between predicted values and the observed
values vs the sample numbers is represented. Based on these figures,
it can be concluded that predicted RI values have satisfactory agreement
with the observed values of RI.
Figure 11
OSVR-predicted RI in each sample number
for (a) testing data, (b)
validation (O), and (c) validation (DO). It can be concluded that
OSVR has a satisfactory capability in predicting the RI.
OSVR-predicted RI in each sample number
for (a) testing data, (b)
validation (O), and (c) validation (DO). It can be concluded that
OSVR has a satisfactory capability in predicting the RI.Moreover, according to the statistical results
presented in Table , it can be clearly
concluded that this method is effective and viable for modeling RI.
Sensitivity Analysis
SA enables us
to decide which input parameter would have the highest impact on the
final outcome, which is the prediction of RI. For this purpose, an
extremely simple structure was proposed based on Gandomi et al.’s[86] workflow. The following formulas are used to
calculate the dependency of output on independent variables[86]Here, fmax(x) and fmin(x) are the max and the min of the estimated
values over the ith input domain, respectively, while
mean values of other variables are replaced. Table shows the results of SA for the proposed
models. It explains that the highest influence on the RI of hemoglobin
is from concentration (C).
Table 7
Sensitivity Analysis
of Constructed
Models for Different Input Variables
input
ACE
ONN
OFIS
OSVR
concentration
66.51993
66.23167
66.34054
65.53819
wavelength
26.13098
26.62641
26.97092
26.90064
temperature
7.34910
7.14193
6.68853
7.56117
Comparative Study
Table presents the accuracy and general
performance of the constructed models (ONN, OFIS, OSVR, and ACE) at
a glance. This table evaluates the four models based on the assessed
statistical criteria in section . As it can be seen in the table, results of ACE have
smaller values of AARE, SMAPE, and MSE and higher R2 in
comparison with the optimized models, which proves its superiority
in estimation of RI. In addition, a visual comparison of performances
of designed models is depicted in Figure . According to this figure, we understand
that ACE’s capability in estimation of RI is better compared
to other AI models.
Conclusions
Obtaining
a model to accurately
determine the RI of hemoglobin
is absolutely vital because this parameter plays a prominent role
in medical diagnosis. To make quantitative formulations between the
RI of hemoglobin and its influencing parameters, a statistical technique
named ACE and three optimized models including ONN, OFIS, and OSVR
were employed in this research. Results of the predictive models were
appraised based on the statistical criteria. Moreover, SA was applied
to measure the importance of each parameter in the estimation of RI.
According to the simulation results of this study, the following conclusions
can be achieved from this paper:The efficiency of ACE in relating the
RI of hemoglobin to input parameters was enhanced successfully. Moreover,
all three optimized models’ (ONN, OFIS, and OSVR) estimations
of RI of hemoglobin were highly accurate when we compared predictions
with observed values.Capability of the BA in optimizing the
weight and bias of a neural network could highly improve the model
prediction performance. This was attained since the BA finds the optimum
value of user-defined parameters of support vector regression with
good accuracy.The embedding
of BA in the formulation
of a fuzzy inference system led to optimal computation of membership
functions with satisfactory precision to ultimately generate RI values
with the least discrepancy with measured RI values.Considering the evaluation of the models
based on statistical criteria, the accuracy of ACE was found to be
higher than those of ONN, OFIS, and OSVR. Finally, SA indicated that
the important input parameters in the prediction of RI are concentration,
wavelength, and temperature.According to promising results of ACE,
implementation of this methodology can conveniently eliminate the
prohibitive cost of experimental measurement of hemoglobin’s
RI.3
Authors: Manuel E Gegundez-Arias; Diego Marin-Santos; Isaac Perez-Borrero; Manuel J Vasallo-Vazquez Journal: Comput Methods Programs Biomed Date: 2021-04-08 Impact factor: 5.428
Authors: Tajudeen A Oyehan; Ibrahim O Alade; Aliyu Bagudu; Kazeem O Sulaiman; Sunday O Olatunji; Tawfik A Saleh Journal: Comput Biol Med Date: 2018-04-30 Impact factor: 4.589
Authors: YongKeun Park; Monica Diez-Silva; Gabriel Popescu; George Lykotrafitis; Wonshik Choi; Michael S Feld; Subra Suresh Journal: Proc Natl Acad Sci U S A Date: 2008-09-04 Impact factor: 11.205