Literature DB >> 35626457

A Two-Parameter Fractional Tsallis Decision Tree.

Jazmín S De la Cruz-García¹, Juan Bory-Reyes², Aldo Ramirez-Arellano¹.

Abstract

Decision trees are decision support data mining tools that create, as the name suggests, a tree-like model. The classical C4.5 decision tree, based on the Shannon entropy, is a simple algorithm to calculate the gain ratio and then split the attributes based on this entropy measure. Tsallis and Renyi entropies (instead of Shannon) can be employed to generate a decision tree with better results. In practice, the entropic index parameter of these entropies is tuned to outperform the classical decision trees. However, this process is carried out by testing a range of values for a given database, which is time-consuming and unfeasible for massive data. This paper introduces a decision tree based on a two-parameter fractional Tsallis entropy. We propose a constructionist approach to the representation of databases as complex networks that enable us an efficient computation of the parameters of this entropy using the box-covering algorithm and renormalization of the complex network. The experimental results support the conclusion that the two-parameter fractional Tsallis entropy is a more sensitive measure than parametric Renyi, Tsallis, and Gini index precedents for a decision tree classifier.

Entities: Chemical

Keywords: Gini index; complex networks; decision trees; two-parameter Tsallis entropy

Year: 2022 PMID： 35626457 PMCID： PMC9141694 DOI： 10.3390/e24050572

Source DB: PubMed Journal: Entropy (Basel) ISSN： 1099-4300 Impact factor: 2.738

1. Introduction

Entropy is a measure of the unpredictability of the state in physical systems that would be needed to specify the degree of disorder in full micro-structure of them. Claude Elwood Shannon [1] defined a measure of entropy to measure the amount of information in a digital system in the context of theory communication that has been applied in a variety of fields such as information theory, complex networks, and data mining techniques. The most widely used form of the Shannon entropy is given by where N is the number of possibilities and . Two celebrated generalizations of Shannon entropy are Renyi [2] and Tsallis entropies [3]. Alfred Renyi proposed a universal formula to define a family of entropy measures given by the expression [2] where q denotes the order of moments. Constantino Tsallis proposed the -logarithm defined by to introduce a physical entropy given by [3] Tsallis entropy could be rewritten [4,5,6] as where of a function f given by , , stands for the Jackson [7] -derivative, to reflect that it is an extension of Shannon entropy. Renyi and Tsallis entropy measures depend on the parameter q, which describes their deviations from the standard Shannon entropy. Both entropies converge to Shannon entropy in the limit . For complex network applications [8] and data mining techniques [9,10,11,12,13,14,15,16,17], the parameter q varies into a range of values. On the other hand, the computation of the entropic index q of the Tsallis entropy was implemented for physics applications in [18,19,20,21,22,23,24,25]. Shannon and Tsallis entropies can be obtained by the action of standard derivative or -derivative, respectively, to the same generating function with respect to variable t and then letting . This approach can be used to reveal different entropy measures based on the actions of appropriate fractional order differentiation operators [26,27,28,29,30,31,32]. The major goal of this work is to introduce a new decision tree based on a two-parameter fractional Tsallis entropy. This new kind of tree is tested on twelve databases for a classification task. The structure of the paper is as follows. Section 2 focuses attention on the notion of two-parameter fractional Tsallis entropy. In Section 3, two-parameter fractional Tsallis decision trees and a constructionist approach to the representation of the databases as a complex network are introduced. The basic facts on the box-covering algorithm of a complex network are reviewed. Finally, we compute an approximation set of parameters q, , and of the two-parameter fractional Tsallis entropy. Section 4 is concerned with the testing of two-parameter fractional Tsallis decision trees on twelve databases. Next, the approximations of q values are tested on Renyi and Tsallis entropies. Discussion of the findings of this study and concluding remarks are offered in Section 5.

2. Two-Parameter Fractional Tsallis Entropy

Based on the actions of fractional order differentiation operators, several entropy measures of fractional order are introduced in [26,27,28,29,30,33,34,35,36,37,38,39,40,41,42,43]. Following this approach in [22], the two-parameter fractional Tsallis entropy is introduced by merging two typical examples of fractional entropies. The first fractional entropy of order is introduced as and the second one by and where denotes the gamma function. Combining (6) with (7) yields a two-parameter fractional relative entropy as follows [31]: for . The entropy (10) reduces to (6) when and reduces to (7) when , yielding the Shannon entropy when both parameters approach Analogously, two extra-parameter-dependent Tsallis entropies are introduced [22]: and Combining theses entropies and motivated by (10), we obtain the following two-parameter fractional Tsallis entropy [22]: for Note that Tsallis entropy is recovered when . This implies that the non-extensibility of [44] forces to be so.

3. Parametric Decision Trees

A decision tree is a supervised data mining technique that creates a tree-like structure, where the non-leaf node tests a given attribute [45]. The outcome gives us the path to reach a leaf node, where the classification label is found. For example, let (x = 3, y = 1) be a tuple to be classified by the decision tree of Figure 1. If we test , we must follow the left path to reach and finally arrive at the leaf node with the classification label “a”.

Figure 1

A decision tree to the classification task.

In general, the cornerstone of the construction process of decision trees is the evaluation of all attributes to find the best node and the best split condition on this node to classify the tuple with the lower error rate. This evaluation is carried out by information gain on each attribute a [45]: where is the entropy of the database after being partitioned by the condition c of a given attribute a and is the entropy induced by c. The tree’s construction needs to evaluate several partition conditions c on all attributes of the database, then chooses the pair of attribute–condition with the highest value. Once a pair is chosen, the process evaluates the partitioned database recursively using a different attribute–condition. The reader is referred to [45] for details on decision tree construction and computation of (14).

3.1. Renyi and Tsallis Decision Trees

In classical decision trees, I in (14) denotes Shannon entropy; however, other entropies such as Renyi or Tsallis can replace it. Thus, (14) can be written using Renyi entropy (2) as and using Tsllis entropy (4) as follows:The parametric decision trees generated by (15) or (16) have been studied in [9,10,11,12,13,14].

3.2. Two-Parameter Fractional Tsallis Decision Tree

Following a similar fashion, a two-parameter fractional decision tree can be induced by the information gain obtained by rewritten (14) using (13): An alternative informativeness measure for constructing decision trees is the Gini index, or Gini coefficient, which is calculated by The Gini index can be deduced from Tsallis entropy (4) using [14]. On the other hand, the two-parameter fractional Tsallis entropy with , , reduces to the Gini index. Hence, Gini decision trees are a particular case of both Tsallis and two-parameter fractional Tsallis trees. The main issue with Renyi and Tsallis decision trees is the estimation of -value to obtain a better classification than the one produced by the classical decision trees. Trial and error is the accepted approach for this purpose. It consists of testing several values in a given interval, usually , and comparing the classification rates. This approach becomes unfeasible in two-parameter fractional Tsallis decision trees as it is needed to tune q, , and . A representation of a database as a complex network is introduced to face this issue. This representation lets us compute and following the approach in [22], which is the basis for determining the fractional decision tree parameters.

3.3. Network’s Construction

A network is a powerful tool to model the relationships among entities or parts of a system. When those relationships are complex, i.e., properties that cannot be found by examining single components, something emerges that is called a complex network. Thus, networks as a skeleton of complex systems [46] have attracted considerable attention in different areas of science [47,48,49,50,51]. Following this approach, a representation of the relationships among attributes (system entities) of a database (system) as a network is obtained. The attribute’s name will be concatenated before the value of a given row to distinguish the same value that might appear on different attributes. Consider the first record of the database shown on the top of Figure 2. The first node will be , the second node will be , and the third node will be . These nodes belong to the same record, so they must be connected; see dotted lines of the network in the middle of Figure 2. We next consider the second record; the nodes and will be added to the network. Note that the node was added in the previous step. We may now add the links between these three nodes. This procedure is repeated for each record in the database.

Figure 2

Network construction from a database. The nodes in the same color belong to the same box for .

The outcome is a complex network that exhibits non-simple topological features [52], which cannot be predicted by analyzing single nodes as occurs in random graphs or lattices [53].

3.4. Computation of Two-Parameter Fractional Tsallis Decision Tree Parameters

By the technique introduced in [22], the parameters and —on the network representation of the database—of the two-parameter fractional Tsallis decision tree are defined to be where is the number of nodes in the box obtained by the box-covering algorithm [54], n is the number of nodes of the network, and is the average degree of the nodes of the box. Similarly, two values of are computed as follows [22]: where is the diameter of the box , is the diameter of the network, and is the number of links among the boxes . The computation of and will be explained later. Inspired by the right-hand term of (19) and (20) (named ) with the fact that is a normalized measure of the number of boxes to cover the network [20], an approximation of the -value for the two-parameter fractional decision tree is introduced:Similarly, from the right hand of (21) and (22) (named ), a second approximation of the -value is given by where is the minimum number of boxes of diameter l to cover the network, n, , , . The process to compute the minimum number of boxes of diameter l to cover the network G is shown in Figure 3. A dual network () is created only with the nodes of the original network, Figure 3b. Then, the links in are added following the rule: two nodes i, j, in the dual network, are connected if the distance between is greater than or equal to l. In our example, , and node one is selected to start. Node one will be connected in with nodes five and six since their distance is four and three. The procedure is repeated with the remaining nodes to obtain the dual network shown in Figure 3b. Next, the nodes will be colored as follows: two directly connected nodes in must not have the same color. Finally, the nodes colored in are mapped to G; see Figure 3c. The minimum number of boxes to cover the network given l equals the number of colors in G. In addition, the nodes in the same color belong to the same box. In practice, ; thus, of the example are shown in Table 1. For details of the box-covering algorithm, the reader is referred to [54].

Figure 3

Box covering of a network for . (a) Original network. (b) Dual network. (c) Colouring process. (d) Mapping colours to the original network.

Table 1

The results of and from the network of Figure 3, and the “pseudo matrix” of .

l	Nb(l)	δ	αl,1′	αl,2′	αl,3′
1	6	-	-	-	-
2	3	0.107	α2,1′	α2,2′	α2,3′
3	2	0.107	α3,1′	α3,2′	-
4	2	0.143	α4,1′	α4,2′	-
5	1	-	-	-	-

Now, we are ready to compute . Two boxes were found following the previous example for for ; see Figure 4a. The is the average link per node between the nodes of this box; for this reason, the link between nodes four and six is omitted in this computation. Similarly, . The is the degree of each node of the renormalized network; see the network of Figure 4b.

Figure 4

Renormalization of a network. (a) Grouping nodes into boxes. (b) Converting boxes into supernodes.

In our example, . The renormalization converts each box into a super node, preserving the connections between boxes. On the other hand, it is known that and ; in the first case, each box contains a node, and in the second one, there is one box to cover the network that contains all nodes. For this reason, the and are not defined for and , respectively. This force to as was stated in (19)–(24). Additionally, note that the right hand of (19) and (20) (), (21) and (22) () are “pseudo matrices”, where each row has values; see Table 1. Consequently, and are also “pseudo matrices”. The network represents the relationships between attribute-value (nodes) of each record and the relationships between different database records. For example, the dotted lines in Figure 2 show the relationships between the first record’s attribute value. Links of the node ZIP.08510 are the relationships between the three records, and the links of PHONE.54-76-90 are the relationships between the first and third one. The box-covering algorithm groups these relationships into boxes (network records). The network in the middle of Figure 2 shows that the three boxes (in orange, green, and blue) coincide with the number of records in the database. However, the attribute value of each box does not coincide entirely with records in the database since box-covering finds the minimum number of boxes with the maximum number of attributes where the boxes are mutually exclusive. The nodes in each box (network record) are enough to differentiate the records in the database. For example, the first network record consists of name, phone, and zip values (nodes in orange). The second record in the database can be differentiated from the first by its name and phone (values of those attributes are the second network record in green). The third one can be distinguished from the two others by its name (the third network record in blue). The cost of differentiating the first network record (measured by ) is the highest; meanwhile, the lowest is for the third. Thus, measures the local differentiation cost for the network records. On the other hand, measures the global differentiation cost (by ). For example, the global cost for the first network record is two, and one for the second and third; see the renormalized network (at the bottom of Figure 2). It means that the first network record needs to be differentiated from two network records, and the second and third only need to be distinguished from the first. Note that , for a given l relies on the topology network that captures the relationships of the records and their values. Finally, is the ratio between network records (normalized number of boxes ) and the local differentiation cost; meanwhile, is the ratio between network records and the global differentiation cost.

4. Methodology

Twelve databases (biological, technological, and social disciplines) from the UCI repository [55] were managed in the experiments; see Table 2. Their number of records, attributes, and classes are representative. Once a network was obtained from the database, q, , and parameters of the fractional Tsallis decision were approximated by the following four sets: , , , , , , , , , , , , where means the average value of the pseudo matrices obtained by (19)–(24).

Table 2

Database and network features. N = nominal, U = numerical, M = mixed.

Database	Records	Attributes	Type	Classes	Balanced	Nodes	Edges
Breast Cancer	699	9	N	2	No	737	1276
Car	1728	6	N	4	Yes	25	70
Cmc	1473	9	M	3	No	74	264
Glass	214	10	U	7	No	1159	1743
Haberman	306	3	U	2	No	94	395
Hayes	160	5	N	3	No	150	186
Image	2310	19	U	7	Yes	12,705	24,411
Letter	20,000	16	U	16	Yes	282	2700
Scale	625	4	N	3	No	23	90
Vehicle	946	18	U	4	Yes	1434	8064
Wine	178	13	U	3	No	1279	2239
Yeast	1484	9	M	10	No	1917	4907

The network can be obtained from a raw database or after being discretized. Since the classification—measured by the area under receiver operating characteristic curve (AUROC) and Matthews correlation coefficient (MCC)—was better using the approximations computed on the networks from discretized databases, these approximations are only reported. The attribute discretization of a database can be found in [56]. The discretization technique is unsupervised and uses equal-frequency binning. The discretized databases were only used to obtain the networks so that the classification task was carried out using the original databases. The networks obtained from the discretized and non-discretized databases turned out to be different; see Figure 5.

Figure 5

The networks from (a) non-discretized and (b) discretized vehicle database.

The classification task was performed by classical, Renyi, Tsallis, Gini, and the two-parameter fractional Tsallis decisions trees on each database. We used a 10-fold cross-validation repeated ten times to calculate the AUROC and MCC. The best value of the AUROC and MCC, produced by one of the four sets of parameters—used to approximate q, , and —of fractional Tsallis decisions trees, was chosen and compared with the classical and Gini decision trees. In the same way, or was chosen for the q parameter of Renyi and Tsallis trees. Then, their AUROCs and MCCs were compared with those of the classical trees. It is known that decision trees could produce non-normal distributed AUROC and MCC measures [57]. Hence, the normality was verified by the Kolmogorov–Smirnov test. These measures were compared using a T or a U Mann–Whitney test, according to their normality [10,57,58,59].

5. Applications

The approximations of q, , and parameters computed on discretized databases are shown in Table 3. Table 4 shows the AUROC and MCC of classical and two-parameter fractional Tsallis decisions trees and the result of the statistical compassion. In addition, the values of the parameters of fractional Tsallis decisions trees are reported.

Table 3

The parameters of the fractional Tsallis decision tree were obtained using the networks from discretized databases.

Database	<qα′>	<qβ′>	<α1>	<β1>	<α2>	<β2>
Breast Cancer	0.173	0.189	1.147	1.134	0.853	0.866
Car	0.303	0.347	1.137	1.120	0.863	0.880
Cmc	0.169	0.185	1.152	1.138	0.848	0.862
Glass	0.171	0.187	1.154	1.141	0.846	0.859
Haberman	0.344	0.420	1.333	1.273	0.667	0.727
Hayes	0.269	0.310	1.231	1.200	0.769	0.800
Image	0.117	0.123	1.056	1.054	0.944	0.946
Letter	0.155	0.165	1.05	1.047	0.950	0.953
Scale	0.352	0.421	1.217	1.182	0.783	0.818
Vehicle	0.092	0.096	1.106	1.101	0.894	0.899
Wine	0.119	0.127	1.147	1.138	0.853	0.862
Yeast	4.574	5.081	1.003	1.003	0.997	0.997

Table 4

The AUROC and MCC of classical (CT) and two-parameter fractional Tsallis decision trees (TFTT) and their parameters q, , . + means that AUROC or MCC is statistically greater than AUROC or MCC of CT.

Database	CTAUROC	TFTTAUROC	CTMCC	TFTTMCC	q	α	β	Param. Set
Breast Cancer	0.959	0.964 +	0.889	0.967 +	0.173	0.853	0.866	<qα′>, <α2>, <β2>
Car	0.981	0.982	0.892	0.912 +	0.347	0.863	0.880	<qβ′>, <α2>, <β2>
Cmc	0.691	0.714 +	0.315	0.349 +	0.169	1.152	1.138	<qα′>, <α1>, <β1>
Glass	0.794	0.874 +	0.56	0.673 +	0.171	1.154	1.141	<qα′>, <α1>, <β1>
Haberman	0.579	0.610 +	0.18	0.156	0.344	1.333	1.273	<qα′>, <α1>, <β1>
Hayes	0.869	0.895 +	0.578	0.645 +	0.269	1.231	1.200	<qα′>, <α1>, <β1>
Image	0.994	0.992	0.982	0.978	0.123	1.056	1.054	<qβ′>, <α1>, <β1>
Letter	0.969	0.974 +	0.912	0.934 +	0.155	0.950	0.953	<qα′>, <α2>, <β2>
Scale	0.845	0.861 +	0.678	0.703 +	0.421	1.217	1.182	<qβ′>, <α1>, <β1>
Vehicle	0.762	0.755	0.395	0.387	0.092	0.894	0.899	<qα′>, <α2>, <β2>
Wine	0.968	0.977 +	0.933	0.957 +	0.119	1.147	1.138	<qα′>, <α1>, <β1>
Yeast	0.743	0.733	0.462	0.463	4.574	0.997	0.997	<qα′>, <α2>, <β2>

The two-parameter fractional Tsallis decision tree outperforms the AUROC and MCC of the classical trees for eight databases. The statistical result of both measures disagrees with Car and Haberman. The AUROC of the two-parameter fractional Tsallis tree was equal to the classical trees for Car, Image, Vehicle, and Yeast; meanwhile, for Haberman, Image, Vehicle, and Yeast, the MCC of both trees showed no difference. Tsallis entropy is a non-extensive measure [60] as well as a two-parameter fractional Tsallis entropy [22]. On the contrary, Shannon entropy is extensive. The super-extensive property is given by , and sub-extensive property by . Note that the approximations of the q parameter for all the databases, see Table 3, are except for Yeast. Thus, they can be considered candidates for being named super-extensive databases. We say that a database is super-extensive if and its value produces a better classification (AUROC, MCC, or another measure) than the classical trees (based on Shannon entropy). Similarly, a database is sub-extensive if and its value produces a better classification. Otherwise, the database is extensive since, in this case, the Shannon entropy (the cornerstone of classical trees) is a less complex measure than the two-parameter fractional Tsallis entropy; hence Shannon entropy must be preferred. The two-parameter fractional Tsallis trees produce classifications equal to or better than the classical trees. Following those conditions, based on MCC, Breast Cancer, Car, Cmc, Glass, Hayes, Letter, Scale, and Wine are super-extensive. Meanwhile, Haberman, Image, Vehicle, and Yeast can be classified as extensive. The AUROC and MCC of Renyi and Tsallis decision trees are compared with the baseline of the classical ones. The and were tested as the entropic index of both parametric decision trees. The parameters of Renyi () and Tsallis () that produce the better AUROCs and MCCs are reported in Table 5. The result shows that the AUROC of Renyi trees was better for Breast Cancer, Glass, Letter, and Yeast and worse for Cmc and Haberman than classical trees. The results are quite similar for MCC, where Car’s classification outperforms the classical tree classification. On the contrary, the MCC of the Vehicle database was statistically less than that of the classical tree. The Tsallis AUROCs were better for Cmc, Glass, Haberman, Hayes, and Wine and worse for Yeast than those of classical trees. Additionally, the MCCs of Car, Cmc, Glass, and Scale were higher, and lower for Yeast, than the classical trees’ MCCs. Based on MCC, Car, Cmc, Glass, and Scale are super-extensive, which is a subset of the classification obtained by two-parameter fractional Tsallis.

Table 5

AUROC and MCC of classical (CT), Renyi (RT), and Tsallis (TT) decision trees. + means that AUROC is statistically greater than AUROC or MCC of CT, and − means the opposite.

Database	CTAUROC	RTAUROC	TTAUROC	CTMCC	RTMCC	TTMCC	qr	qt
Breast Cancer	0.959	0.971 +	0.963	0.889	0.901 +	0.887	<qα′>=0.173	<qα′>=0.173
Car	0.981	0.983	0.982	0.892	0.906 +	0.912 +	<qα′>=0.303	<qβ′>=0.347
Cmc	0.691	0.676 −	0.712 +	0.315	0.256 −	0.35 +	<qβ′>=0.185	<qα′>=0.169
Glass	0.794	0.838 +	0.835 +	0.56	0.622 +	0.599 +	<qβ′>=0.187	<qα′>=0.171
Haberman	0.579	0.500 −	0.610 +	0.18	0.024 −	0.152	<qα′>=0.344	<qα′>=0.334
Hayes	0.869	0.869	0.895 +	0.578	0.579	0.587	<qα′>=0.269	<qα′>=0.269
Image	0.994	0.997	0.995	0.982	0.984	0.978	<qα′>=0.117	<qβ′>=0.123
Letter	0.969	0.980 +	0.967	0.912	0.939 +	0.913	<qβ′>=0.165	<qα′>=0.155
Scale	0.845	0.839	0.857	0.678	0.651	0.706 +	<qβ′>=0.421	<qβ′>=0.421
Vehicle	0.762	0.776	0.748	0.395	0.297 −	0.371	<qβ′>=0.096	<qα′>=0.092
Wine	0.968	0.963	0.976 +	0.933	0.923	0.924	<qα′>=0.119	<qα′>=0.119
Yeast	0.743	0.789 +	0.578 −	0.462	0.505 +	0.098 −	<qβ′>=5.081	<qα′>=4.574

Finally, the Gini and the two-parameter fractional Tsallis decisions trees are compared using AUROC and MCC. The results are shown in Table 6. These results indicate that two-parameter fractional Tsallis trees outperform AUROC of Gini trees in six databases, and MCC in ten. It underpins that Gini trees are a particular case of two-parameter fractional Tsallis trees with . In summarizing, two-parameter fractional Tsallis trees have better classifications than classical and Gini trees.

Table 6

AUROC and MCC of Gini decision trees (GT) and two-parameter fractional Tsallis decision trees (TFTT). + means that AUROC is statistically greater than AUROC or MCC of GT.

Database	GTAUROC	TFTTAUROC	GTMCC	TFTTMCC
Breast Cancer	0.963	0.964	0.888	0.967 +
Car	0.981	0.982	0.897	0.912 +
Cmc	0.58	0.714 +	0.357	0.349
Glass	0.712	0.874 +	0.437	0.673 +
Haberman	0.52	0.61 +	0.068	0.156 +
Hayes	0.871	0.895 +	0.655	0.645
Image	0.988	0.992	0.946	0.978 +
Letter	0.962	0.974 +	0.894	0.934 +
Scale	0.866	0.861	0.654	0.703 +
Vehicle	0.71	0.755 +	0.294	0.387 +
Wine	0.932	0.977	0.847	0.957 +
Yeast	0.728	0.733	0.414	0.463 +

6. Conclusions

This paper introduces two-parameter fractional Tsallis decision trees underpinned by fractional-order entropies. The three parameters of this new decision tree need to be tuned to produce better classifications than the classical ones. The trial and error approach is the standard method to adjust the entropic index for Renyi and Tsallis decision trees. However, it is unfeasible for two-parameter fractional Tsallis trees. From a database representation as a complex network, it was possible to determine a set of values for parameters q, , and based on this network. The experimental results on twelve databases show that the proposed values yield better classifications (AUROC, MCC) for eight of them, and for the four remaining, the classification was equal to that produced by classical trees. Moreover, two values (, ) were tested in Renyi and Tsallis decision trees. The results show that Renyi outperforms the classical trees in four (AUROC) and five (MCC) out of twelve databases. Similarly, Tsallis decision trees produced better classification for five (AUROC) and four (MCC) databases. The classification was worse in almost three and one databases for Renyi and Tsallis, respectively. The overall results of both parametric decision trees suggest that both outperform the classical trees in seven databases. All of the above is less favorable than what happened in eight databases analyzed with the two-parameter fractional Tsallis decision trees. In addition, the databases with a better classification using Tsallis decision trees are a subset of those for which two-parameter fractional Tsallis trees produced a better classification. It supports the conjecture that two-parameter fractional Tsallis entropy is a finer measure than the parametric entropies such as Renyi and Tsallis. The approximate technique for the tree parameters introduced here is a valuable alternative for practitioners. Furthermore, the network classification based on the non-extensive properties of Tsallis and two-parameter fractional Tsallis entropies reveals that the relationships between the records and their attribute values (modeled by a network) are complex. Such complex relationships are better measured by two-parameter fractional Tsallis entropy, the cornerstone of the proposed decision tree. The results pave the way for using the two-parameter Tsallis fractional entropy in other data mining techniques such as K-means, generic MST, Kruskal MST, and algorithms for dimension reduction in the future. Our research has the limitation that the databases used in the experiments are not large enough to reveal the reduction in time compared with the trial-and-error approach to set the tree parameters. However, we may conjecture that our method works in large databases, which will be the scope of future research.

8 in total

A Two-Parameter Fractional Tsallis Decision Tree.

1. Introduction

2. Two-Parameter Fractional Tsallis Entropy

3. Parametric Decision Trees

3.1. Renyi and Tsallis Decision Trees

3.2. Two-Parameter Fractional Tsallis Decision Tree

3.3. Network’s Construction

3.4. Computation of Two-Parameter Fractional Tsallis Decision Tree Parameters

4. Methodology

5. Applications

6. Conclusions

1. Complex Systems Research in Educational Psychology: Aligning Theory and Method.

Review 2. Local Patterns to Global Architectures: Influences of Network Topology on Human Learning.

3. A Novel Method to Rank Influential Nodes in Complex Networks Based on Tsallis Entropy.

4. An Entropy Formulation Based on the Generalized Liouville Fractional Derivative.

5. A New Chaotic System with Stable Equilibrium: Entropy Analysis, Parameter Estimation, and Circuit Design.

6. New Texture Descriptor Based on Modified Fractional Entropy for Digital Image Splicing Forgery Detection.

7. Classification of Literary Works: Fractality and Complexity of the Narrative, Essay, and Research Article.