Literature DB >> 27298619

A General Fuzzy Cerebellar Model Neural Network Multidimensional Classifier Using Intuitionistic Fuzzy Sets for Medical Identification.

Jing Zhao1, Lo-Yi Lin2, Chih-Min Lin3.   

Abstract

The diversity of medical factors makes the analysis and judgment of uncertainty one of the challenges of medical diagnosis. A well-designed classification and judgment system for medical uncertainty can increase the rate of correct medical diagnosis. In this paper, a new multidimensional classifier is proposed by using an intelligent algorithm, which is the general fuzzy cerebellar model neural network (GFCMNN). To obtain more information about uncertainty, an intuitionistic fuzzy linguistic term is employed to describe medical features. The solution of classification is obtained by a similarity measurement. The advantages of the novel classifier proposed here are drawn out by comparing the same medical example under the methods of intuitionistic fuzzy sets (IFSs) and intuitionistic fuzzy cross-entropy (IFCE) with different score functions. Cross verification experiments are also taken to further test the classification ability of the GFCMNN multidimensional classifier. All of these experimental results show the effectiveness of the proposed GFCMNN multidimensional classifier and point out that it can assist in supporting for correct medical diagnoses associated with multiple categories.

Entities:  

Mesh:

Year:  2016        PMID: 27298619      PMCID: PMC4889801          DOI: 10.1155/2016/8073279

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

In most of the medical diagnosis problems, there exist some base patterns, and the medical decisions can be made on the basis of the similarity between the unknown samples and the base patterns [1-3]. Uncertainty is an inherent characteristic of medical problems [4, 5], so fuzzy approach could be appropriate to deal with these problems [6]. In order to describe uncertainty more accurately, a suitable fuzzy set is necessary. Fuzzy sets (FSs), proposed by Zadeh [7], are frameworks to employ when encountering some vagueness. Based on the concepts of fuzzy set theory, numerous fuzzy approaches to medical diagnosis have been applied [8-12]. Another fuzzy theory, intuitionistic fuzzy sets (IFSs), has been widely used in several investigations of medical diagnosis [8, 13–16]. IFSs were proposed by the Bulgarian scholar Atanassov in 1986 [17]. On the basis of conventional fuzzy sets, a new attribute parameter, the nonmembership function, is added, which can describe the characterization of fuzzy nature more precisely. Due to the abundance of fuzzy linguistic terms in comparison to conventional fuzzy logic, interest in adopting IFSs with artificial neural networks has emerged [18-25]. The IFSs approach can combine the capability of intuitionistic fuzzy reasoning in handling uncertain information and the benefits of artificial learning in modeling the systems. This paper proposed a novel multidimensional classifier based on an intelligent algorithm in IFSs. This method is referred to as a fuzzy cerebellar model neural network (FCMNN). A cerebellar model neural network (CMNN) can be thought of as a learning mechanism imitating the cerebellum of a human being and possesses a non-fully connected perceptron-like associative memory network with overlapping receptive fields [26]. It has already been shown to be able to approximate a nonlinear function over a domain of interest to any desired accuracy. Combined with fuzzy theory, FCMNN not only offers a unique and flexible framework for knowledge representation but also processes the quick learning ability of CMAC. The advantages of using FCMNN in many applications have been well documented [27-29], such as good generalization and rapid learning speed and convergence. Moreover, FCMNN can be viewed as the generation of a fuzzy neural network. If each layer of the FCMNN is reduced to contain only one different neuron, then it can be reduced to a fuzzy NN [30-33], such that it also can be called the general fuzzy cerebellar model neural network (GFCMNN). Therefore, a GFCMNN multidimensional classifier is designed for medical classification problems in IFSs with a similarity measure. Finally, some simulations and comparisons are performed to illustrate the effectiveness of the proposed design method. This paper is organized as follows. The GFCMNN multidimensional classifier is introduced in Section 2. Section 3 describes the medical data features in IFSs. In Section 4, experimental results are provided to illustrate the effectiveness of the proposed classifier. Finally, conclusions are drawn in Section 5.

2. General Fuzzy Cerebellar Model Neural Network Multidimensional Classifier

In most cases, a cerebellar model neural network is applied without fuzzy rules. To enable better use of experience knowledge, an extended general fuzzy cerebellar model neural network is designed for the multidimensional classifier.

2.1. Structure of the GFCMNN Multidimensional Classifier

A GFCMNN with the following fuzzy inference rules is proposed:where R is the λth rule, n is the input dimension, n is the number of the layers for each input dimension, n is the number of blocks for each layer, n is the output dimension, n = n n is the number of the fuzzy rules, f is the fuzzy set for the ith input, jth layer, and kth block, and w is the weight for the oth output in the consequent part. The architecture of this GFCMNN is shown in Figure 1. It is different from the fuzzy neural network (FNN) because the processing structure includes layers and blocks in the association memory space.
Figure 1

Architecture of a GFCMNN.

In this GFCMNN, if each layer is reduced to contain only one different neuron, then this GFCMNN can be reduced to an FNN. Thus, this GFCMNN can be viewed as a generalization of an FNN, and it offers better generalization, faster leaning, and quicker recall than the FNN. The GFCMNN is composed of two consequent mappings and an output computation with the spaces of the input space I, association memory space A, receptive-field space R, weight memory space W, and output space O. These functional mappings are Mapping I : I → A, Mapping A : A → R, and Mapping R : R → W, where I and O are n -dimension and n -dimension, respectively. The signal propagation and the basic function in each space are described as follows.

2.1.1. Input: Input Space I

For a given I = [I 1,…, I ,…, I ] ∈ ℜ , each input state variable I is assumed to be quantized into n discrete regions (called “elements” or “neurons”). The number of elements n is termed as a resolution.

2.1.2. Membership Function: Association Memory Space A

In this space, several elements can be accumulated as a block. The number of blocks n , in the GFCMNN, is usually greater than two. By shifting each element, different blocks will be obtained. In this space, each block performs a receptive-field membership function. Here, the Gaussian function is adopted as the receptive-field membership function represented aswhere m is the mean and v is the variance of the jth layer and kth block membership function corresponding to the ith input variable.

2.1.3. Fuzzy Inference: Receptive-Field Space R

The product fuzzy inference is used as the “and” computation in the antecedent part. Thus, hypercubes, called receptive fields, are formed by multiple-input regions. The number of hypercubes is equal to n = n n . The content of a hypercube can be expressed as r , which is the general basis function associated with the jth layer and kth block, that isThe multidimensional receptive-field functions can be expressed in a vector form as

2.1.4. Fuzzy Output: Weight Memory Space W

Each location of the receptive field in relation to a particular adjustable value in the weight memory space can be expressed aswhere w denotes the connecting weight value of the oth category output associated with the jth layer and kth block.

2.1.5. Category Output: Output Space O

The GFCMNN output is the algebraic sum of the hypercube contents with activated weights. The oth output of the GFCMNN multidimensional classifier is represented as

2.2. Normalized Gradient Descent Algorithm

Because of the characteristic of fast convergence, the normalized iterative gradient decent algorithm is applied to adjust the parameters, and back propagation (BP) has been designed to deduce the learning rule of this GFCMNN multidimensional classifier.

2.2.1. Cost Function

To describe the online learning algorithm of GFCMNN, the cost function E is defined aswhere e (k) = d (k) − O (k) denotes the error of oth category output, d (k) is the oth target output, and O (k) is the oth category output of GFCMNN.

2.2.2. Update Learning Laws

With the cost function E(k), the parameter updating learning law of GFCMNN based on the normalized gradient descent algorithm can be derived according towhere z is replaced by w, m, and v, denoting the updating law for output weight, mean, and variance, respectively. Moreover, the gradient operation factor P (k) = ∂O /∂z in (8) is defined as Then, the parameter adjustment rules of weight, mean, and variance can be described as where η , η , and η are the learning rates of output weight, mean, and variance, respectively.

2.3. Convergence Analyses

The learning laws in (12), (13), and (14) call for a proper choice of the learning rates η , η , and η . For a small value of learning rates, the convergence is easy to guarantee; however, the learning speed is slow. On the other hand, if learning rates are too large, the learning mechanism may become more unstable. In order to train the GFCMNN effectively, the variable learning rates, which guarantee the convergence of the output error, are derived as follows. Moreover, the optimal learning rates which guarantee the fastest convergence of the output error are also derived.

Theorem 1 .

Let η be the learning rate of the GFCMNN, and let P (k) be given in (9), (10), and (11) for z = w, m, or v, respectively. Then, the convergence of the tracking error is guaranteed if η is chosen asMoreover, the variable optimal learning rate equals

Proof

Define a Lyapunov function as Then, the change of the Lyapunov function is obtained asand the error difference can be represented by Using the chain rule, the following is obtained: Thus, Substituting (21) into (18), ΔV(k) can be represented asIf η is chosen as (15), ΔV(k) in (22) is less than 0. Therefore, the Lyapunov stability of V > 0 and ΔV < 0 is guaranteed. Thus, the convergence of tracking error e (k) is guaranteed. Moreover, for the sake of achieving the fastest convergence, the optimal learning rates correspond to η = 1/‖P (k)‖2, which comes from the derivative of (22) with respect to η and equals zero. This shows an interesting result, that is, variable optimal learning rates which can be adjusted online at each instant to achieve the fastest convergence of the tracking error with guaranteed stability. In conclusion, the GFCMNN multidimensional classifier is defined by (6). The parameter learning rule is deduced by the normalized gradient descent algorithm and the weight, mean, and variance can be adjusted according to (12), (13), and (14), respectively. The optimal learning rates are designed to guarantee the convergence of this GFCMNN multidimensional classifier.

3. Medical Data Features in Intuitionistic Fuzzy Sets

3.1. Intuitionistic Fuzzy Sets for Medical Data

To make a proper medical classification, a medical knowledge base is necessary. In this paper, an IFS for a medical knowledge base is considered. Fuzzy sets theory, proposed by Zadeh [7] in 1965, has been successfully applied in various fields. In this theory, the membership of an element to a fuzzy set is a single value between zero and one, and the linguistic variables are given in terms of a membership function only. However, in some situations, such as medical diagnosis, sales analysis, and financial services, which are described by a conventional fuzzy set, this theory seems too rough. Due to the possibility of a nonnull hesitation part for any unknown object at each moment when estimating, the degree of nonmembership of an element to a fuzzy set is just equal to 1 minus the degree of membership; that is, there may be some degree of hesitation. Thus, as a generalization of fuzzy sets, the concept of IFSs was introduced by Atanassov in 1986 [15]. In an intuitionistic fuzzy set, A in the universe of discourse X can be defined as a set of ordered pairs:where μ : X → [0,1] and v : X → [0,1] indicate the degree of x which belongs to A and does not belong to A, respectively. μ (·) is called the membership function, and v (·) is called the nonmembership function. For each IFS A in X, the “hesitation margin” (or “intuitionistic fuzzy index”) of x ∈ X is given bywhere 0 ≤ μ (x) ≤ 1, 0 ≤ v (x) ≤ 1, and 0 ≤ π (x) ≤ 1, ∀x ∈ X, which expresses the hesitation degree of whether x belongs to A or not. An illustration of these degrees is exhibited in Figure 2. Consequently, IFSs are an extension of the conventional FSs. To describe an IFS completely, at any rate, two functions are needed, one being the membership function and the other being the nonmembership function. In the aspects of semantic expression and reasoning ability, this is clearly better than conventional FSs.
Figure 2

Descriptions of the IFSs.

In the case of medical diagnosis, we consider the same IFSs as in the disease classification in [34, 35]. There are five diseases in Table 1 to build the set of diseases D = [viral fever, malaria, typhoid, stomach problem, chest pain]. Each disease has five symptoms to form the set of features F = [temperature, headache, stomach pain, cough, chest pain]. Each element in the table is given in the form of a group of numbers corresponding to the membership, nonmembership, and hesitation values, respectively. For example, the temperature for viral fever is described by (μ, v, π) = (0.4,0.0,0.6) in Table 1.
Table 1

Feature values for the studied diseases categories.

DiseaseFeature
TemperatureHeadacheStomach painCoughChest pain
Viral fever(0.4, 0.0, 0.6)(0.3, 0.5, 0.2)(0.1, 0.7, 0.2)(0.4, 0.3, 0.3)(0.1, 0.7, 0.2)
Malaria(0.7, 0.0, 0.3)(0.2, 0.6, 0.2)(0.0, 0.9, 0.1)(0.7, 0.0, 0.3)(0.1, 0.8, 0.1)
Typhoid(0.3, 0.3, 0.4)(0.6, 0.1, 0.3)(0.2, 0.7, 0.1)(0.2, 0.6, 0.2)(0.1, 0.9, 0.0)
Stomach problem(0.1, 0.7, 0.2)(0.2, 0.4, 0.4)(0.8, 0.0, 0.2)(0.2, 0.7, 0.1)(0.5, 0.7, 0.1)
Chest problem(0.1, 0.8, 0.1)(0.0, 0.8, 0.0)(0.2, 0.8, 0.0)(0.2, 0.8, 0.0)(0.8, 0.1, 0.1)

3.2. Score Function of IFSs

In the traditional FSs, the fuzzy relationship is obtained by the max-min-max composition. In order to fully use the provided information of IFSs, the intuitionistic fuzzy relation can be described by use of the score function, which is usually used to judge the matching degree between the intuitionistic fuzzy relation and the decision requirements. The score function, such as R = μ − v, which is proposed by Chen and Tan [36], has two elements of IFSs, but another element of hesitation degree π cannot be taken into account. A modification of the score function has been proposed (score function 1) [37]: Despite considering the factor of π, the proportion of the membership function μ has actually been weakened because of adopting the subtracted form. To overcome this shortcoming, another added form is defined as (score function 2) [38] However, in fact, a reasonable description of IFSs would be as follows: the higher the proportion of the membership function, the lesser the proportion of the nonmembership function, that is, the maximum difference between μ and v with the minimum hesitation degree π. Therefore, to describe the IFSs relation much more accurately and carefully, the applied score function is defined as (score function 3)where α and β are the constants. The greater the value of R is, the more accordant to the reality the event will be, which is described by (27) with the diverse values of α and β. By using this kind of score function, the different decision-making degrees of the three members in the IFSs are embodied while handling uncertain information, such that not only have the shortcomings of the above score functions been made up, but also the ability of the fuzzy linguistic expression has become more precise, and the description in modeling a system is more complete.

4. Experiment and Discussion

4.1. Similarity Measure for Multidimensional Classifier

The solutions of the classification are usually the distance measure and the similarity measure. Distance is defined as a quantitative degree of how far apart two objects are, while similarity is defined as the degree of similarity between two sets. In this paper, the similarity measure is used as a performance index to discuss the classification results. The similarity measure of the algorithm is defined as follows:where A is the target data set, B is the testing data set, ∩ and ∪ denote the intersection and union of A and B, respectively, and |·| indicates the cardinality of a set. This relative cardinality evaluates the proportion of elements of A ∪ B having the property A∩B, when A ∪ B is finite.

4.2. Medical Diagnosis Example

The medical diagnosis classification trained data are shown in Table 1. The tested data are shown in Table 2, which consists of a set of patients S = [Al, Bob, Joe, Ted]. The symptoms for each patient are also given in Table 2.
Table 2

Data set for the studied diseases categories.

SampleFeature
TemperatureHeadacheStomach painCoughChest pain
Al(0.8, 0.1, 0.1)(0.6, 0.1, 0.3)(0.2, 0.8, 0.0)(0.6, 0.1, 0.3)(0.1, 0.6, 0.3)
Bob(0.0, 0.8, 0.2)(0.4, 0.4, 0.2)(0.6, 0.1, 0.3)(0.1, 0.7, 0.2)(0.1, 0.8, 0.1)
Joe(0.8, 0.1, 0.1)(0.8, 0.1, 0.1)(0.0, 0.6, 0.4)(0.2, 0.7, 0.1)(0.0, 0.5, 0.5)
Ted(0.6, 0.1, 0.3)(0.5, 0.4, 0.1)(0.3, 0.4, 0.3)(0.7, 0.2, 0.1)(0.3, 0.4, 0.3)
Five features of the diseases are taken as the input of GFCMNN; that is, the input consists of 5 dimensions. And the given data shown in Table 1 are used to train the proposed GFCMNN multidimensional classifier offline. It should be classified into 5 categories according to the five diseases. In other words, the output dimension of GFCMNN is 5. Then, the considered data shown in Table 2 are used to test the correctness of this classifier. In this case, the GFCMNN is characterized as n = 5 for each input dimension to cover the input range with enough resolution; then, four layers are used and every layer contains five neurons. In the initial parameters setting, the learning rates of the GFCMNN classifier are 0.1. All of the parameters are determined by trial-and-error, in order to guarantee the desired classification performance. The initial weights, means, and variances, w , m , and v , are generated randomly. The sample types of the medical IFSs are set as membership function μ, (25), (26), and (27), where α = 100 and β = 10. The tested performances of GFCMNN are shown in Tables 3, 4, 5, and 6.
Table 3

Test performances of GFCMNN with membership function.

Sample Disease
Viral feverMalariaTyphoidStomach problemChest problem
Al0.49500.69340.47610.19160.1480
Bob0.35550.18970.64740.38900.2129
Joe0.53840.44980.51410.13680.0769
Ted0.45940.60070.39080.39510.2139
Table 4

Test performances of GFCMNN with score function 1.

Sample Disease
Viral feverMalariaTyphoidStomach problemChest problem
Al0.26670.56750.10700.12990.0143
Bob0.22740.09710.78850.10090.0446
Joe0.12770.05840.23420.05060.0388
Ted0.23460.40320.17020.15620.0212
Table 5

Test performances of GFCMNN with score function 2.

Sample Disease
Viral feverMalariaTyphoidStomach problemChest problem
Al0.70450.80520.65390.41500.2902
Bob0.56000.36490.64410.68470.3993
Joe0.68390.59950.83150.44050.3013
Ted0.64300.64620.56310.63050.2810
Table 6

Test performances of GFCMNN with score function 3.

Sample Disease
Viral feverMalariaTyphoidStomach problemChest problem
Al0.55810.67760.30110.04520.1422
Bob0.46790.19400.64610.68620.3090
Joe0.77980.52530.81120.42880.2623
Ted0.42740.60320.15960.09390.2586
According to the tested performances from Tables 3 –6, if only the membership function μ is used, the classification results are as follows: Al suffers from malaria, Bob from typhoid, Joe from typhoid, and Ted from malaria. If the score function (25) is used as the type of input, the classification results are as follows: Al suffers from malaria, Bob from typhoid, Joe from typhoid, and Ted from viral fever. When the score function (26) is used as the type of input, the classification results are as follows: Al suffers from malaria, Bob from stomach problem, Joe from typhoid, and Ted from viral fever. Under the score function (27), with α and β values identical to those above used as the pattern of input, the same classified results are obtained. Meanwhile the doctor's diagnosis results are as follows: Al suffers from malaria, Bob from stomach problem, Joe from typhoid, and Ted from malaria. Compared to the same example, a type of score function, like (25), is used in [27], and the max-min-max rule is applied to classify the five categories. Another kind of score function like μ + v was adopted, in [39], which used the method of intuitionistic fuzzy cross-entropy (IFCE) to adjust the same medical diagnosis. After mapping these five diseases to class 1–class 5, the results of comparison are tabulated in Table 7. The doctor's diagnosis is also given for comparison.
Table 7

Comparing the test performances of GFCMNN with score function 3.

SampleGFCMNN IFS μ   IFCE μ + v Doctor
μ μ μ + μv + 1/(απ + β)
Al2222212
Bob3344344
Joe1333213
Ted2222212
Table 7 shows that these disease samples can be classified by using the GFCMNN multidimensional classifier, and the accuracy is much better than those of the IFSs classifier and the IFCE classifier, even with the use of an input pattern, like μ. With the same input pattern, the classification precision of the GFCMNN classifier is also superior to that of the IFSs classifier. After adopting the GFCMNN input patterns, such as (26) and (27), the classification results are consistent with the doctor's diagnosis, although the input pattern, for instance (26), is not the best input pattern for intuitionistic fuzzy data.

4.3. Cross Validation

To further test the classification ability of the GFCMNN multidimensional classifier, the cross verification method is adopted, after combining the 5 trained samples with the 4 tested samples, which were correctly classified previously, and renumbering them as shown in Table 8.
Table 8

Renumbering of the medical intuitionistic fuzzy sample.

Sample numberOriginal infoCategory
1Viral fever1
2Malaria2
3Typhoid3
4Stomach problem4
5Chest problem5
6Ted1
7Al2
8Joe3
9Bob4
These renumbered samples are divided into 2 groups: the trained set with 5 samples and the tested set with 4 samples. In the cross validation phase, one sample of the same type is exchanged for each round by using the GFCMNN input pattern, such as membership function μ, (26) and (27). The initial parameters of GFCMNN and the score function 3 are set the same as previously mentioned. The test results are shown in Tables 9 –11.
Table 9

Cross validation results of GFCMNN with membership function 1.

SampleTest 1Test 2Test 3Test 4Test 5Expected
s1222222
s2222222
s3133333
s4343344
Table 10

Cross validation results of GFCMNN with score function 2.

SampleTest 1Test 2Test 3Test 4Test 5Expected
s1222222
s2222222
s3333333
s4443434
Table 11

Cross validation results of GFCMNN with score function 3.

SampleTest 1Test 2Test 3Test 4Test 5Expected
s1222222
s2222222
s3333333
s4444344
In this case, from Tables 9 –11, the true results are T = 16, T = 18, and T = 19, while the wrong results are N = 4, N = 2, and N = 1. Thus, the accuracy is equal to 80%, 90%, and 95%, as calculated by The experimental results show that multiple type medical data can be identified by the proposed GFCMNN classifier. Moreover, combined with fuzzy intuitionistic data, much better classification precision has been achieved.

5. Conclusion

The proposed GFCMNN multidimensional classifier is an extended structure composed of a fuzzy system and CMNN, with the respective advantages of each. The classification efficiency can be improved because of the better generalization ability, learning ability, and approximate ability of the proposed approach. When combined with the IFSs, the original features can be better presented and the classification accuracy is also enhanced. Meanwhile, the experimental results have demonstrated the effectiveness of the proposed classifier. Therefore, the classification results of the GFCMNM multidimensional classifier can assist doctors by supporting the medical diagnosis.
  5 in total

1.  Adaptive CMAC-based supervisory control for uncertain nonlinear systems.

Authors:  Chih-Min Lin; Ya-Fu Peng
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2004-04

2.  Adaptive control for mimo uncertain nonlinear systems using recurrent wavelet neural network.

Authors:  Chih-Min Lin; Ang-Bung Ting; Chun-Fei Hsu; Chao-Ming Chung
Journal:  Int J Neural Syst       Date:  2012-02       Impact factor: 5.866

3.  Adaptive filter design using recurrent cerebellar model articulation controller.

Authors:  Chih-Min Lin; Li-Yang Chen; Daniel S Yeung
Journal:  IEEE Trans Neural Netw       Date:  2010-07

4.  Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases.

Authors:  Stavros Lekkas; Ludmil Mikhailov
Journal:  Artif Intell Med       Date:  2010-06-20       Impact factor: 5.326

5.  An Efficient Interval Type-2 Fuzzy CMAC for Chaos Time-Series Prediction and Synchronization.

Authors:  Ching-Hung Lee; Feng-Yu Chang; Chih-Min Lin
Journal:  IEEE Trans Cybern       Date:  2013-04-16       Impact factor: 11.448

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.