Literature DB >> 36217509

Rough set approximations based on a matroidal structure over three sets.

Gang Wang¹, Hua Mao², Chang Liu², Zhiming Zhang², Lanzhen Yang³.

Abstract

Pawlak's classical model of rough set approximations provides an efficient tool for extracting information exactly by employing available knowledge (i.e., known knowledge) in an information system, since many problems in rough set theory are NP-hard and their solution process is therefore greedy and approximate. Many extensions of Pawlak's classical model have been proposed in recent years. Most of them are considered over one or two sets, that is, one- or two-dimensional space or one- or two-dimensional data. Aided by relation-based rough set models, a few of these extensions are considered over three sets. However, the real world is in three-dimensional space. Therefore, it is necessary to solve these problems with other models, such as covering rough set models. For this purpose, we propose the TP-matroid-a matroidal structure over three sets. Employing the family of feasible sets of a TP-matroid as the available knowledge, a pair of rough set approximations-lower and upper approximations-is provided. In addition, for an information system defined over three sets, assisted by formal concept analysis, we establish a pair of rough set approximations. Furthermore, two TP-matroids are established based on the above pair of rough set approximations. The integration between the two pairs of rough set approximations presented here is discussed. The results show that for an information system in three-dimensional space, the rough set approximations provided here can effectively explore unknown knowledge by using available knowledge based on the family of feasible sets of a TP-matroid.

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Entities: Chemical

Keywords: Covering; Rough set approximations; Semiconcept; TP-matroid; Three sets

Year: 2022 PMID： 36217509 PMCID： PMC9534742 DOI： 10.1007/s10489-022-04144-5

Source DB: PubMed Journal: Appl Intell (Dordr) ISSN： 0924-669X Impact factor: 5.019

Introduction

Rough set theory, proposed by Pawlak [1, 2], addresses the vagueness and uncertainty of data tables. Its basic operators are known as lower and upper approximations. Pawlak’s classical rough set approximations are defined by a partition of a universe (i.e., a nonempty set) [1, 2], which restricts the applications of rough sets in real cases. Many researchers have generalized Pawlak’s classical rough set model based on more general binary relations [3-8], by employing coverings [4, 8–12], or by combining the model with other theories such as matroid theory [13-17] and others [18-33]. Moreover, Pawlak’s classical model is also restricted by the number of universes, which is one. Hence, another interesting type of generalization of Pawlak’s classical rough set model is to extend the single universe to more than one universe, which has become a very popular topic in recent years and has yielded fruitful results [34-40]. Among them, it is worth mentioning that based on relations, Sun and Ma [36] generalized Pawlak’s classical rough set model from one universe to not only two but three universes and considered further multi-universe cases for fuzzy rough sets, even infinite universes. For relation-based fuzzy rough sets, the model in [36] is perfect. However, now, with respect to covering-based rough sets over multiple universes, there are few articles with results as good as those of [36], although some achievements have been made for two different universes [41, 42]. There are differences and connections between the two rough set models—relation-based and covering-based [8]. Therefore, it is necessary to consider generalizing Pawlak’s classical rough set model from one to three sets from the perspective of the covering rough set model. The achievements of rough sets in application fields are low-hanging fruit in many domains [23, 36, 43–55]. They show that the demands of practical use in many real-life fields are one of the driving forces promoting the development of rough set theory. Matroid theory, proposed by Whitney [56], is used to generalize graph theory and linear algebra [57, 58]. Since its inception, many matroidal structures have been produced by combination with other theories, such as rough sets [13–17, 59, 60]. Matroid theory can be employed to solve combinatorial optimization problems due to its good structure for greedy algorithms [57, 58]. In real life, some information appears with matroid constraints, so problems that involve such information need to be solved with the assistance of matroid theory [61-65]. In what follows, the necessity of studying a covering rough set model over a matroidal structure in reality is illustrated through an example of the biological classification of insects on the basis of morphology. According to the common methods of biological classification of insects, we can see that (1) in research on the classification of insects from morphology, the researcher first collects the insect specimens of some group. Next, for a family of specimens from different locations, or even specimens from the same location, combined with the morphological characteristics that the researcher believes need to be considered, the properties of the specimens in terms of these morphological characteristics are taken as the research content; the researcher will use his or her existing insect morphological knowledge that is closest to the discussed content to approximate the discussed content to obtain the results that the researcher believes are most appropriate. The collected specimens of the insect group are the first factor in analysis and research, the morphological characteristics that the researcher believes should be considered are the second factor, and the collected locations of the specimens in this insect group are the third factor. The three factors belong to three different considered sets. (2) The results of the research that the researcher believes are most appropriate can be obtained only after step-by-step analysis. This is actually a ’greedy’ process. Because matroid theory builds a good platform for greedy algorithms, we can conclude that the known knowledge structure of the researcher related to the research content constitutes a matroidal structure. (3) The approximate inference process of the researcher is also that of approximate inference to unknown knowledge from known knowledge; that is, the lower and upper approximations of the rough set are used to express the unknown knowledge. By (1) and (2), it is necessary to establish a matroidal structure over three sets. Combining (2) and (3), we conclude that it is necessary to study the lower and upper approximation operators of rough sets based on a matroidal structure over three sets. In [36], to describe the motivation of the study, an example given in Section 1, of a disease diagnosis decision-making problem in a clinic, illustrates a relation-based rough set model over three universes for realistic decision-making problems. We will look at this problem from the perspective of covering rough set models over three sets. Since each disease must show many basic symptoms and some concrete results of clinical examination, the known knowledge of the doctor is a set consisting of three parts for a disease d: BS is the set of basic symptoms of d, CE is the set of concrete clinical examination results, and D is {d}. The doctor will compare the basic symptoms and the results of the clinical examination of the patient to known diseases and analyze them to finally determine the most likely disease through the approximate inference method. The known knowledge of the doctor relative to his or her known diseases consists of three parts: {BS∣BS is relative to a disease d}, {CE∣CE is relative to a disease d}, and {D∣D is a disease d}. The process of comparative analysis by the doctor determines the optimal solution from the knowledge base of the doctor with respect to the diseases that are closest to that of the patient. This process is greedy. Combined with matroid theory, which provides a good platform for greedy algorithms, the structure of the known knowledge of the doctor is related to a matroidal structure. Approximate inference is the doctor’s representation of unknown knowledge with his or her known knowledge relative to diseases, which is an approximate representation of a rough set. We should note that if the doctor’s known knowledge base with respect to diseases does not completely cover the patient’s symptoms and clinical examination results, the inference process must be absolutely approximate. For instance, when COVID-19 first broke out in 2019, no doctor in the world had known knowledge that covered this new disease; only approximative knowledge was available to make inferences regarding this new disease. This type of inference finds an optimal solution from the doctor’s known knowledge base with respect to diseases; that is, it is a greedy inference. Therefore, this new disease was called unexplained pneumonia at the time, although doctors now have knowledge of this disease and some ways to treat it. Hence, it is necessary to discuss rough sets as well as covering rough sets based on a matroidal structure over three sets. As Ytow et al. [66] discussed, biological classification has an intimate relation to rough set theory. We note that both biological classification and doctors’ decision-making are considered in three-dimensional space. Additionally, mining valuable information from an information system expressed in three parts is already being explored by many researchers, such as in [36, 67, 68]. The real world is in three-dimensional space. The human cognitive process moves from lower dimensions to higher dimensions, from one-dimensional to two-dimensional space and then to three-dimensional space. Rough set theory is one of the methods by which human beings understand the world. Constructing a covering rough set model over three universes, or three-dimensional space, has become an urgent task. Completing this work is exactly in line with patterns of human cognition. Additionally, many problems in rough set theory are NP-hard, so solving these problems is often greedy; that is, greedy algorithms often need to be used, equivalently to say, matroid theory often need to be used. Hence, it is necessary to build up a matroidal structure on three-dimensional space, i.e., on the Cartesian product of three sets. Using this new matroidal structure, it is also necessary to construct approximation operators in rough set theory that are expressed in ternary form. For this purpose, we present the following contributions: For every structure and some of the definitions and properties presented in this paper, corresponding explanations are given through examples, where the information tables come from biological information systems. First, we present a matroidal structure over three sets—TP-matroid—and demonstrate that TP-matroid is an extension of Whitney’s classical matroid [56-58] under the idea of isomorphisms. Considering approximations of rough sets in knowledge spaces [69] with approximations in covering rough sets [11], we provide a pair of lower and upper approximations using the set of feasible sets of a TP-matroid. Second, with the help of formal concept analysis, we explore a pair of lower and upper approximations expressed in ternary form over three sets. Furthermore, we construct two TP-matroids by using this pair of lower and upper approximations. The integration of the two pairs of approximations in this paper is also discussed. There are two research goals of this paper: one is to theoretically study rough sets, aided by matroid theory over three sets, and the other is for the results provided here to be used in actual practice; we provide some examples with practical information systems. The rest of this paper is organized as follows: In Section 2, we review some basic definitions and properties of matroids, formal concept analysis, and rough sets. In Section 3, we first provide a matroidal structure over three sets with ternary form, i.e., a TP-matroid, and determine how to find rough set approximations over three sets with a precovering TP-matroid. In Section 4, for information data relative to formal contexts over three sets, we provide a pair of lower and upper approximations expressed in ternary form with the help of formal concept analysis. Using this pair of approximations, two TP-matroids are built. Concluding remarks are given in the last section.

Some notions and properties

Below, we review some basic notions used in this paper. For more details, matroid theory is referred to in [57, 58], formal concept analysis is seen in [70], semiconcepts are seen in [71], poset theory is referred to in [72], and rough sets are seen in [1, 2]. Since a data table is finite in practice, we assume that all of the discussions are finite in this paper.

Some notations

Let U, V and W be three sets. Then we will use the following notations in this paper for , and . |X| stands for the cardinality of . , and . , and . (X1,Y1,Z1) ∪ (X2,Y2,Z2) := (X1 ∪ X2,Y1 ∪ Y2,Z1 ∪ Z2). (X1,Y1,Z1) ∩ (X2,Y2,Z2) :⇔ (X1 ∩ X2,Y1 ∩ Y2,Z1 ∩ Z2). (X1,Y1,Z1) ∖ (X2,Y2,Z2) :⇔ (X1 ∖ X2,Y1 ∖ Y2,Z1 ∖ Z2). |(X,Y,Z)| := |X| + |Y | + |Z|, that is, the cardinality of (X,Y,Z). (X1,Y1,Z1) ⊔ (X2,Y2,Z2) := (X1 ∪ X2,Y1 ∩ Y2,Z1 ∪ Z2). 2 represents the power set of a set S. “E is in unary (binary; ternary) form” means: E := X(E := (X,Y );E := (X,Y,Z)), where . If there is a bijection , then we say U and V are isomorphic, denoted as U≅V. A ‘universe’ is a nonempty set. The Cartesian product of one set (two sets; three sets) U(U,V ;U,V,W) is U(U × V ;U × V × W) and is called one- (two-; three-) dimensional space.

Remark 1

We sometimes write y for {y} if y is a singleton set.

Matroid

Definition 1

[57, p.7][58, p.7] A matroid M is a set S and a collection of subsets of S (called independent sets) such that (i1)-(i3) are satisfied. . and . and for some y ∈ Y ∖ X. [57, p.11][58, p.9] Two matroids M1 and M2 on S1 and S2 respectively are isomorphic if there is a bijection that preserves independence. We write M1≅M2 if M1 and M2 are isomorphic.

Formal concept analysis

Formal concept analysis (or a concept lattice), proposed by Wille [73], is a useful and successful tool for dealing with data represented by a kind of information table—a formal context. It is well known that many data tables are similar in form to formal contexts. Hence, to study rough sets and matroids, formal concept analysis is a good tool [18–20, 26, 35, 69]. Next, we review some definitions and lemmas for formal concept analysis.

Definition 2

[70, pp.17-18] A formal context is a set structure such that O and P are nonempty sets and ; the elements of O and P are called objects and attributes, respectively, and gIm is (g,m) ∈ I. The derivation operators of are defined as follows : for all g ∈ X} and for all m ∈ Y }. [71] In a formal context , a pair (X,Y ) with and is called a ⊓-semiconcept if . Dually, a pair (C,D) with and is called a ⊔-semiconcept if .

Lemma 1

[70, p.19] The two derivation operators in a formal context satisfy the following condition for any (or ) where j ∈ J and J is an index set: .

Remark 2

For a formal context , if x ∈ O (or x ∈ P), then is abbreviated as . We can easily find that the family of ⊓-semiconcepts has the dual property of that of the family of ⊔-semiconcepts. Hence, we only consider the family of ⊔-semiconcepts and simply use semiconcept instead of ⊔-semiconcept in what follows. All semiconcepts in a formal context are denoted as .

Posets and equivalence relations

Definition 3

[58, p.45] A poset is a set S together with a binary relation ≤, i.e., a partial order, such that the following properties hold for ∀x,y,z ∈ S:

Definition 4

[72, pp.2-3] A binary relation ε on a nonempty set A is called an equivalence relation if it satisfies the following three properties for ∀a,b,c ∈ A: (a,a) ∈ ε. (a,b) ∈ ε ⇒ (b,a) ∈ ε. (a,b) ∈ ε and (b,c) ∈ ε ⇒ (a,c) ∈ ε.

Rough set

Definition 5

[1,2] Let U be a universe, be an equivalence relation on U, and [x] denote the equivalence class involving the element x. For any , we call and , the lower and upper approximations of X about the Pawlak approximation space(U,R), respectively. Let U/R = {[x]∣x ∈ U}. Every element in U/R is called R-basic category. is called an R-definable if X is the union of some R-basic categories; otherwise, X is R-undefinable.

Lemma 2

[1,2] Let (U,R) be a Pawlak approximation space. The lower and upper approximations can be described by the following an equivalent form: . is R-definable .

Definition 6

[11] Let U be a universe, and be a family of subsets of U. If no subsets in are empty and , then is called a covering of U. is called a covering approximation space. [74] Let Q be a universe. A knowledge structure is denoted by a pair , where . The only special assumption about is that it must contain the empty set and the full set Q. Considering the definition of a covering approximation space in Definition 6, we can state that the expression of Definition 6(1) over three sets is given below, where at least one of U,V and W is a universe. Let be a family of subsets of (U,V,W); i.e., . If none of the subsets in is (∅,∅,∅) and , then is called a covering of (U,V,W). is called a covering approximation space. In the coming Example 2 in Section 3, we will see that is a covering of (U,V,W). We generalize the definition of a knowledge structure in Definition 6 from one set to three sets.

Definition 7

Let U,V and W be three sets such that at least one of U,V and W is a universe. Let and . Then, is called a knowledge space and is called basic knowledge. Comparing Definition 6(2) with Definition 7, we see that Definition 7 is a generalization of Definition 6(2) since need not satisfy in Definition 7, but the corresponding condition is included in Definition 6(2). Yao et al. [11] pointed out that when generalizing Pawlak’s approximations, one task is to specify a subset of these properties that new approximation operators are required to preserve. Hence, according to Pawlak’s approximations, Yao et al. [11] and Yao [74] presented generalized definitions for lower and upper approximation operators, respectively. Considering Definitions 5, 6 and 7, Lemma 2, and the discussion in [11, 74] with the expression of approximations for knowledge spaces in [69], we can present the following definition:

Definition 8

Let S be a universe. Suppose that is a knowledge space in which and . Then, and , where , are a pair of lower and upper approximations on 2 if and only if and satisfy the following conditions with a partial order ≤ defined on 2 for any : , .

Rough set approximations produced by a new matroidal structure—TP-matroid

To combine rough sets and matroids, we first need to generalize the construction of matroids from one set to three sets, in particular, three universes. Then, we can explore rough set approximations with the new matroidal structure.

Relationships between TP-matroids and matroids

We generalize the definition of a matroid from one set to three sets.

Definition 9

Let U, V and W be three sets such that at least one of U,V and W is not empty. Let ; i.e, we have a collection of subsets of U × V × W (called feasible sets) such that (I1)-(I3) are satisfied for . . . Let . If at least one of X2,Y2 and Z2 is not empty, and |(X1,Y1,Z1)| < |(X2,Y2,Z2)|, then holds for some (x2,y2,z2) ∈ (X2,Y2,Z2) ∖ (X1,Y1,Z1) such that at least one of x2,y2 and z2 is not empty. Then, is called a three-partial matroid, abbreviated as TP-matroid. Let be a TP-matroid. If satisfies and , then is called a precovering TP-matroid. Two TP-matroids and are isomorphic if there is a bijection that preserves feasibility. We write if and are isomorphic. Characteristics of stridulatory files

Remark 3

Let U be a set of collected insect specimens, V be a set of considered morphological characteristics, and W be a set of locations of the collected specimens in U. Let be a TP-matroid, and let . Suppose and . Biologists will consider the common characteristics Y of when they analyze the set X of specimens during classification. Then, will imply if Yis the set of common characteristics of X(j = 1,2). This follows from . That is, the order ‘’ in (I2) is reasonable in some practical cases. We first explain some terms in Definition 9. A matroid M in Definition 1 is defined on one set. That is, the background set of M consists of one ‘part’. The background set of a TP-matroid in Definition 9 consists of three ‘parts’—-U,V and W. In other words, is an extension of the matroid from one set to three sets. Hence, is called a three-partial matroid or simply a TP-matroid. Definition 6(1) and Example 2 below show that for a TP-matroid , even if , may not be a covering of (U,V,W) since holds for some TP-matroids, such as that in Example 2. However, is a covering of (U,V,W) if . Therefore, it is suitable to call this a ‘precovering’ TP-matroid as described in Definition 9(2). Comparing items (1) and (2) in Definition 9, we assert that the structure of a precovering TP-matroid is a special case of the structure of the TP-matroid. We will analyze the existence of (x2,y2,z2) such that at least one of x2,y2 and z2 is not empty if |(X1,Y1,Z1)| < |(X2,Y2,Z2)| in (I3). Let U, V and W be three sets such that one of U,V and W is a universe. Suppose that satisfy the requirement that at least one of X2,Y2 and Z2 is nonempty. Then, we confirm that: |(X1,Y1,Z1)| < |(X2,Y2,Z2)|⇒∃(x2,y2,z2) ∈ (X2,Y2,Z2) ∖ (X1,Y1,Z1), where at least one of x2,y2 and z2 is not empty. The reason for this is as follows: We know that |(X,Y,Z)| = |X| + |Y| + |Z|(j = 1,2). Since at least one of X2,Y2 and Z2 is not empty, this implies |(X2,Y2,Z2)|≠ 0. Therefore, |X2|≠ 0,|Y2|≠ 0 and |Z2|≠ 0 hold. If |(X1,Y1,Z1)| < |(X2,Y2,Z2)|, we assert that one of |X1| < |X2|, |Y1| < |Y2| and |Z1| < |Z2| holds. If this assertion is not true, then |X2|≤|X1|, |Y2|≤|Y1| and |Z2|≤|Z1|. This implies |X2| + |Y2| + |Z2|≤|X1| + |Y1| + |Z1|, a contradiction of the known condition |(X1,Y1,Z1)| < |(X2,Y2,Z2)|. Suppose |X1| < |X2|≠ 0. Then there is an x2 ∈ X2 ∖ X1≠∅ satisfying x2≠∅. Therefore, (x2,∅,∅) ∈ (X2,Y2,Z2) ∖ (X1,Y1,Z1) holds, and (x2,∅,∅)≠(∅,∅,∅) is correct. Similarly, for |Y1| < |Y2|≠ 0 or |Z1| < |Z2|≠ 0, the needed results are correct. The following example shows the existence of a TP-matroid.

Example 1

Table 1 is an expression of some biological information in [75, Table 4].

Table 1

Characteristics of stridulatory files

Specimen	The number of teeth in the distal part	The number of teeth in the proximate part	Source/specimen, origin (scanning electron microscope, SEM)
Japonica 1	4(9)	61	Kim (2009): Korea, SEM
Japonica 2	6	57-60	CH7421-2: Korea (n = 2)
Japonica 3	6	66	Wu (2010): China

Table 4

Some features of stridulatory files

Specimen	The number of teeth in the distal part	The number of teeth in the proximate part	Source/specimen, origin (scanning electron microscope, SEM)
Japonica 1	4(9)	61	Kim (2009): Korea, SEM
Japonica 2	6	57-60	CH7421-2: Korea (n = 2)
Neochlora 1	10	66	Shi et al. (2003): China,SEM
Neochlora 2	5	72	Shi et al. (2003): China,SEM
Neochlora 3	7	68	CH7670: China
Antipoda sp. nov. 1	12	45	CH4147: Australia
Antipoda sp. nov. 2	12	51	CH4148: Australia

Let a := japonicaj,(j = 1,2,3), b1 :=‘The number of teeth in the distal part’, b2 :=‘The number of teeth in the proximate part’, c1 :=‘Korea’, and c2 :=‘China’. Then, we obtain the mathematical expression of Tables 1 in Table 2.

Table 2

Mathematical expression of Table 1

	b₁	b₂
a₁	4(9)	61	c₁
a₂	6	57-60	c₁
a₃	6	66	c₂

Mathematical expression of Table 1 Let U = {a1,a2,a3},V = {b1,b2} and W = {c1,c2}. Let ∪{(a,b1,∅),(a,b1,c1),(a,{b1,b2},∅),(a,{b1,b2},c1),(j = 1,2)}∪ {(∅,b1,∅),(∅,b1,c1),(∅,{b1,b2},∅),(∅,{b1,b2},c1)}∪{({a1,a2},b1,∅),({a1,a2},{b1,b2},∅),({a1,a2},{b1,b2},c1)}. Then, we may easily check that satisfies (I1)-(I3). Therefore, using Definition 9(1), we find that is a TP-matroid. Here, for and , means that in researching the japonica population with the biological information shown in Table 1, one of the basic knowledge items of the biologists is that X, the japonica that comes from location Z, must have characteristics Y. For example, means that the biologist believes the japonica collected in c1 must possess the common characteristic b1. Simply, we denote as {(X,Y,Z),γ ∈Υ}. We find that satisfies and . Considering Definition 6(1) and Definition 9(2), we may easily confirm that The next example will show the existence of a precovering TP-matroid. , i.e., , since , is not a covering of (U,V,W). is not a precovering TP-matroid. We also see that the set of specimens of the insect group is U = {a1,a2,a3}, the set of morphological characteristics that the biologist believes need to be considered is V = {b1,b2}, and the set of locations of the collected specimens is W = {c1,c2}. The available knowledge of the biologist in Example 1 is .

Example 2

Let U,V and W be given as in Example 1. Let {(∅,{b1,b2},c),k = 1,2}∪{(∅,b,∅),j = 1,2}∪{(∅,{b1,b2},∅)}∪{(a,b,∅),i = 1,2,3;j = 1,2}∪{(a,{b1,b2},∅),i = 1,2,3}∪{(a,∅,c),i = 1,2,3;k = 1,2}∪{(∅,∅,c),k = 1,2}∪{(a,∅,∅),i = 1,2,3}∪{(∅,∅,∅)}. Then we may easily find that is a TP-matroid by Definition 9(1). is a covering of (U,V,W) since satisfies using Definition 6(1). is a precovering TP-matroid since by Definition 9(2). The known knowledge of the biologist in Example 2 is on U × V × W.

Remark 4

We next compare the definitions of a matroid and TP-matroid. I) The comparisons of the structures between the two definitions are shown in Table 3.

Table 3

Compare the structures between a matroid and a TP-matroid

	dimension of ground set	range of family of independent(feasible) set	restricted conditions
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$(S,\mathcal {I})$\end{document}(S,I), a matroid	one	2^S	(i1)-(i3)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$(U\times V\times W, \mathcal {T}\mathcal {I})$\end{document}(U×V×W,TI), a TP-matroid	three	2^U × 2^V × 2^W	(I1)-(I3)

Compare the structures between a matroid and a TP-matroid Using Table 3, we find results (1) and (2). Let S = U × V × W. Then, is not a matroid if is a TP-matroid, since according to Definition 9(1). If is a matroid, then . It is easy to see that 2≠ 2 × 2 × 2 in general. For instance, in Example 1, implies 2≠ 2 × 2 × 2. That is, the range of the family of independent sets of a matroid and that of the family of feasible sets of a TP-matroid are different in general. We next compare some relations between the restricted conditions of the matroid and TP-matroid. (i1) means that . Therefore, it follows that . Considering Example 1, we know for some TP-matroid. This indicates that (I1) cannot determine . It only confirms that . Hence, (i1) is a special case of (I1). Conditions (i3) and (I3) have some similarity. The similarity suggests that there is a close relation between the matroid and TP-matroid. Let be defined as in Example 1. Let . We know that is a matroid using Definition 1(1). Let , and . Then, we consider the following two cases: In one case, (*1) (i2) is correct for . If (I2) holds for , then holds, which contradicts . Thus, (*1) implies that (i2) cannot be replaced by (I2). In the other case, (*2) (I2) is correct for . If (i2) holds for , then holds, which contradicts . Hence, (*2) means that (I2) cannot be replaced by (i2). The above two cases show that (i2) and (I2) are independent. II) To continue the discussion of the definitions of matroid and TP-matroid, we can obtain more results for their relations as follows in (3)-(6). We may easily prove V ≅∅× V ×∅. We can also easily demonstrate to be a matroid such that M1≅M2. Suppose that every matroid is a TP-matroid. Then, M1 is a TP-matroid. In fact, we know that M1 is not a TP-matroid since . Hence, even under isomorphisms of sets and matroids, M2 is not a TP-matroid. In other words, a matroid may not be a TP-matroid even up to isomorphism. Suppose that every TP-matroid is a matroid. From Example 1, we know , where is defined as in Example 1. By (i2), we obtain ; in particular, which contradicts Example 1. Thus, not every TP-matroid is a matroid. The above items (3) and (4) imply that the TP-matroid is a new structure that is different from the matroid. Let and , where U and W are defined as in Example 1. Using Definition 1(1), M21 and M23 are matroids. Let M = (M21,M2,M23), i.e., . Then, we obtain that M is not a TP-matroid since . This result indicates that the TP-matroid is not a combination of three matroids. It is a new matroidal structure over three sets.

Remark 5

If is the family of feasible sets of a TP-matroid , then can be seen as a knowledge space by Definition 7 with as the family of basic knowledge. Examples 1 and 2 indicate that in biology, some known knowledge on U × V × W may be used to construct the family of feasible sets of a TP-matroid , where in Example 1 and in Example 2, respectively. Xu et al. [69] depicted a knowledge space for one universe as one of two types of knowledge structures is a knowledge space and closed under set union. Hence, to extend the rough set model of a knowledge space from one universe to three universes, the known knowledge should have a property similar to being closed under set union. Hence, we give the following definition.

Definition 10

Let U,V and W be three sets such that at least one of U,V and W is a universe and . If satisfy , then, is called ⊔-closed.

Remark 6

Let be as in Example 1. Using Definition 10, we may easily show that is ⊔-closed, although is not a covering of (U,V,W) as shown in Example 1. Let be as in Example 2. Using Definition 10, we know that is not ⊔-closed since , although is a covering of (U,V,W) as shown in Example 2. (1) and (2) above imply that the definition of ⊔-closed is independent from that of covering. We will continue to discuss some relationships between matroids and TP-matroids.

Lemma 3

Let U be a universe. If is a TP-matroid, then is a matroid in which . Let be a matroid and U≠∅. If , then is a TP-matroid. The first property of Lemma 3 can be easily verified by Definition 1(1). The second property can be easily proven by Definition 9. These proofs are omitted. Here, we stress the fact that as given in Example 1 is ⊔-closed, and as given in Example 2 is not ⊔-closed. This fact implies that the family of feasible sets of a TP-matroid cannot always have the property of being ⊔-closed. Combined with in Lemma 3(2), we believe the family of independent sets of a matroid does not always have the property of being ⊔-closed; that is, is not ∪-closed. This result is the same as in the discussion of in classical matroid theory [57, 58]. It also hints that there is an intimate relation between matroids and TP-matroids. Using Lemma 3, we may easily obtain since . Furthermore, we obtain the following lemma.

Lemma 4

Let U be a universe (j = 1,2,3,4). Let and be two TP-matroids satisfying . Then, holds. Let and be two matroids such that . Then, holds. Lemma 4 can be easily verified with Definitions 1(2) and 9(3) and Lemma 3. The proof is omitted.

Remark 7

Lemma 3 implies that a matroid on a universe U corresponds to a TP-matroid on U ×∅×∅, and every TP-matroid on U ×∅×∅ corresponds to a matroid on U. Lemma 4 implies that under isomorphism, the correspondences are unique. Combining Lemmas 3 and 4, we may obtain the following theorem.

Theorem 1

The correspondence between a matroid and a TP-matroid is a bijection between is a nonempty set} and is a nonempty set} up to isomorphism for matroids and up to isomorphism for TP-matroids.

Remark 8

Since the structure of a TP-matroid is only a special kind of TP-matroid, combining this expression and Theorem 1, we determine that under isomorphism of matroids and isomorphism of TP-matroids, the definition of a TP-matroid is a generalization of the definition of a matroid. Hence, the TP-matroid is a matroidal structure over three sets.

Approximations generalized by TP-matroids

Section 3.1 generalizes the definition of a matroid from one set to three sets. Examples 1 and 2 imply that sometimes, the basic knowledge of some researchers is constructed by the feasible sets of a TP-matroid. In addition, some problems are solved by some matroidal structures [61–65, 76, 77]. Hence, we hope to solve some problems with the new matroidal structure—the TP-matroid. We note that utilizing a set of basic knowledge (or known knowledge) to infer unknown knowledge is a good and natural strategy. In fact, this inference corresponds to rough set theory. Using matroidal structures has already yielded many results on rough sets, and vice versa. Hence, it is necessary to explore the central content of rough sets——approximation operations——with the assistance of TP-matroids. Considering Definitions 5, 6 and 7, Lemma 2 and Remark 6(1), we provide the following definitions.

Definition 11

Let be a TP-matroid. Let . . or or Z ∩ C≠∅}. ; If one of A,B and C is empty, then define . If any of A,B and C is not empty, then define .

Remark 9

We now analyze Definition 11. Let be a TP-matroid. We analyze items (1) and (3) in Definition 11 as follows. By Definition 9(1), satisfies (I1) and (I2). From (I1), we can suppose . Then, and (I2) together imply . In addition, holds for any . This means that (∅,V,∅) ∈ low(A,B,C). Therefore, low(A,B,C)≠∅ holds. This implies that the definition of is well defined. We analyze items (2) and (4) in Definition 11 as follows. By Definition 3, we may easily obtain that , is a poset with (U,∅,W) as the maximum element. As a generalization of the upper approximation expressed in Lemma 2, we define for if one of X,Y and Z is empty, in particular, if X = U,Y = ∅ and Z = W. Hence, is reasonable in Definition 11 if one of X,Y and Z is empty. Because we have “A≠∅,B≠∅,C≠∅” ⇒ “B ∩ V ≠∅ since ”, and , we obtain (∅,V,∅) ∈ upr(A,B,C) if any of A,B and C is not empty. This means that is well defined for the case of A≠∅,B≠∅ and C≠∅. Let U be the set of collected insect specimens of a group, V be the set of considered morphological characteristics, and W be the set of sources of collected specimens in U. Let U = ∅. This means that a specimen could not be obtained, so no insect specimens were collected for research. This case is not valuable for biologists to research. If V = ∅. This means that there are no morphological characteristics to be considered for the collected specimens. This will not occur in biological research, since any specimen must possess some morphological characteristics to be considered. If W = ∅. This means that the sources of all the collected insect specimens in U are unknown. However, biologists generally know where the researched specimens were collected from. Even in special cases in which the source of a specimen is unknown, biologists will try to infer the source of the specimen. Hence, W≠∅ holds if U≠∅. The above analysis shows that U≠∅,V ≠∅ and W≠∅ generally hold in scientific research. In addition, if is considered by biologists, then in general, we have A≠∅,B≠∅ and C≠∅. Hence, if any of A,B and C is not empty for , then biologists infer the properties of (A,B,C) using their known knowledge , for example, known specimens, known morphological characteristics or known locations, to approximate (A,B,C). This implies that the supposition of X ∩ A≠∅ or Y ∩ B≠∅ or Z ∩ C≠∅ in upr(A,B,C) is reasonable. Furthermore, is effective.

Lemma 5

Let be a precovering TP-matroid. Then, upr(A,B,C)≠∅ holds for such that one of A,B and C is not empty. The proof of Lemma 5 can be found in the Appendix.

Remark 10

We analyze the supposition in Lemma 5 on the basis of biological ideas. Let U be a set of collected insect specimens of a group, V be the set of the considered morphological characteristics, and W be the set of locations of the collected specimens in U. Let be a TP-matroid, and let . Using the set of basic biological knowledge to approximate (A,B,C) is a common method in biological research. If A = B = C = ∅, then according to the discussion in Remark 9(2), this case is not valuable for biologists. Therefore, we assume that at least one of A,B and C is not empty. That is, biologists pay much more attention to . If A ∩ X = B ∩ Y = C ∩ Z = ∅ for any , then no known knowledge exists in to infer the properties of (A,B,C). During actual biological research, some known knowledge generally exists to infer the properties of (A,B,C), or approximate (A,B,C). Hence, should be precovering. That is, the supposition of the precovering of in Lemma 5 is suitable for biological research and more generally for research in real life. We explore some properties of and as characterized in Definition 11 to decide whether and are a pair of lower and upper approximations defined on 2 × 2 × 2 according to Definition 8.

Lemma 6

Let be a TP-matroid. Let and be given as in Definition 11. Then, the following statements are correct for . If one of A,B and C is empty, then holds. Let be precovering. If any of A,B and C is not empty, then holds. If any of A,B and C is not empty and is precovering, then holds. If and (A,B,C) = (U,∅,W), then holds. If is ⊔-closed, then holds. . . The proof of Lemma 6 can be found in the Appendix.

Remark 11

Let U be the set of collected insect specimens in a group, V be the set of considered morphological characteristics, and W be the set of the sources of collected specimens in U. Let . Let be the set of common morphological characteristics for any x ∈ X(j = 1,2). Then, the common morphological characteristics of X1 ∪ X2 must be contained in Y1 ∩ Y2. With the increase in the number of locations, the chance of collecting specimens will increase. Thus, for , the definition of (X1,Y1,Z1) ⊔ (X2,Y2,Z2) = (X1 ∪ X2,Y1 ∩ Y2,Z1 ∪ Z2) is useful in biology. Furthermore, the restricted condition of is ⊔-closed is similar to some ideas in biology. Hence, the supposition that is ⊔-closed in Lemma 6(5) is in line with typical biological ideas. We next use an example to illustrate Definition 11 and Lemma 6.

Example 3

Let be as given in Example 1. Let A = {a3},B = {b2} and C = {c1}. Then by Definition 11, we obtain the following results: . j = 1,2}∪{(∅,{b1,b2},c1), (∅,{b1,b2},∅),({a1,a2},{b1,b2},∅),({a1,a2},{b1,b2},c1)}. (a,{b1,b2},c1),(j = 1,2)}∪{(∅,b1,c1),(∅,{b1, b2},c1),({a1,a2},b1,c1),({a1,a2},{b1,b2},c1)}. Hence, we obtain , and furthermore, . In addition, using Definition 11, we obtain low(A,B,C) = {(∅,{b1,b2},∅),(∅,{b1,b2},c1)} and therefore . We have the following results from the above discussions: (**1) . (**2) since . Result (**1) implies the correctness of Lemma 6(6). Result (**2) shows that if none of A,B and C are empty, this does not imply that . Using Example 1, we know that is not precovering. Hence, the supposition that is precovering is necessary to obtain the consequences in Lemma 6(2) and Lemma 6(3). Let A1 = {a1,a2},B1 = ∅ and C1 = c1, where a1,a2 and c1 are defined as in Example 2. Then, we obtain the following result: (**3) . Combined with Example 2, we know that is not ⊔-closed. The above result (**3) implies that the condition is necessary for the consequence of (A,B,C) to be feasible in Lemma 6(5). We next perform an analysis combining Example 1 and Example 2 with Example 3. Because the evolution of natural history is impossible to repeat, entomologists often use their known entomological knowledge to infer unknown content in their own research. Such inference is helpful for studying the distribution of insect populations, the formation of historical developments, and so on. It is particularly important for the targeted collection of specimens. For instance, in Table 2, a3, i.e., the specimen japonica 3, is collected in c2, i.e., China. Since the Korean Peninsula, to which Korea belongs, and China are connected by land, the entomologists in Example 1 and Example 2 hypothesize that if a3 is collected in c1, i.e., Korea, it may also have the characteristic b2 that it currently has. This is represented by the set (A = a3,B = b2,C = c1) in Example 3. We will see that (1) using his or her known knowledge , the entomologist in Example 1 obtains the pessimistic result of the hypothesis as (∅,{b1,b2},c1) and the optimistic result as (∅,b2,c1) (see Example 3). Both the first coordinates of and are ∅; that is, both of the corresponding sets of specimens of and are ∅. This means that no conjectured specimens will appear. Therefore, this entomologist will not go to Korea, i.e., c1, to collect the specimen according to his or her hypothesis. (2) Using his or her known knowledge , the entomologist in Example 2 obtains the pessimistic result of the hypothesis as (A,B,C) and the optimistic result as (A,B,C). In other words, theoretically, he or she is convinced that the hypothesis is correct. (3) From Example 2, we find . That is, the known knowledge of the entomologist in Example 2 completely covers (A,B,C), but that of the entomologist in Example 1 does not since holds in Example 1. This leads to the different conclusions of the two entomologists regarding the same hypothesis. In fact, and imply that the conclusion of the entomologist in Example 2 is more correct than that of the entomologist in Example 1. Therefore, the hypothesis should be true. (4) In fact, a similar analysis can be done for sets that can be represented in a ternary form (X,Y,Z), where and the three sets U,V and W are as given in Example 1. (5) Rough sets, an intelligent theory, are an effective tool for intelligent computing. (1)-(4) above show that the method proposed here, i.e., rough set approximation based on the TP-matroidal structure, is helpful and usable for the study of insect systematics, which includes the classification of insects. This also shows a practical application of the rough sets provided in this paper. Therefore, it is necessary to further discuss the rough set approximations provided here.

Example 4

Let be a TP-matroid with U≠∅ or W≠∅. Let . Then, we obtain for any since and (I2) holds. In particular, we obtain . That is, is the family of all subsets of (U,V,W). Thus, it is easy to see that is ⊔-closed and is precovering. We may easily obtain . We also see that and . Therefore, we have since one of U and W is not empty.

Remark 12

On the one hand, Example 4 examines the correctness of Lemma 6(4). On the other hand, Example 4 shows that if one of A,B and C is empty in a precovering TP-matroid such that is ⊔-closed, then we cannot confirm even if . By Definition 8 with the relationships between a covering and the feasible sets of a precovering TP-matroid, we obtain the following theorem by Lemmas 5 and 6.

Theorem 2

Let be a precovering TP-matroid and be ⊔-closed. Let satisfy A≠∅,B≠∅ and C≠∅. Then, . . Using items (2) and (6) in Lemma 6, the proof of item (1) is straightforward. The proof of Theorem 2(2) can be found in the Appendix. Using Theorem 2 and Definition 8, we find that and are indeed a pair of rough set approximations based on a precovering TP-matroid with a family of feasible sets that is ⊔-closed. In what follows, we describe how to acquire information from and in Algorithms 1 and 2, respectively. In Algorithms 1 and 2, we need to visit n feasible sets; that is, the complexity of Algorithm 1 is O(n), as is that of Algorithm 2. Acquiring lower approximation based on a precovering TP-matroid. Acquiring upper approximation based on a precovering TP-matroid.

Remark 13

From Definition 8 and Theorem 2, we can find that and are the lower and upper approximations generated by the family of feasible sets of a precovering TP-matroid such that is ⊔-closed. Considering Remark 11, we know that the definition of ⊔-closed for is in line with common ideas. In real cases, biologists and other researchers consider satisfying A≠∅,B≠∅ and C≠∅. Hence, the suppositions in Theorem 2 are valuable according to the ideas of biologists and other researchers. The outline of the process of searching the lower and upper approximations generated by a TP-matroid in this subsection is shown in Fig. 1.

Fig. 1

Diagram of searching for lower and upper approximations from TP-matroids

Diagram of searching for lower and upper approximations from TP-matroids The process in this section is as follows: , a TP-matroid , a pair of operators relative to the approximations, where . , a precovering TP-matroid, and , a ⊔-closed family , a pair of approximation operators, where and A≠∅,B≠∅,C≠∅. The converse of the above process is considered in the next section.

Approximations related to formal contexts

It is necessary to find matroidal structures with rough sets. This work has been done for a single universe, such as in [17]. The TP-matroid is established over three sets in Section 3, and determining how to build constructions of TP-matroids with rough set theory is now the task that we face. Using rough set theory, the first step of this work is to set up a pair of approximation operators. According to Definitions 5, 6, 7 and 8, the pair of approximation operators is based on a family of basic knowledge. We know that rough set theory and formal concept analysis are two important tools for dealing with data tables. This suggests that formal concept analysis may be helpful in our work. Therefore, in this section, we will construct TP-matroids with the help of some rough set approximations based on a kind of data table—a formal context. We provide some preliminary definitions.

Definition 12

Let U = U1 ∪ U2 ∪… ∪ U be a universe satisfying U≠∅ and U ∩ U = ∅(i≠j;i,j = 1,2,…,n). Let V = {b,j = 1,2,…,m} and W = {w,j = 1,2,…,n} be universes. Any two of U,V and W are disjoint. For every w, there is a formal context relative to w(j = 1,2,…,n). The derivation operators of are denoted as . Let and 1 ≤ s ≤ n; the derivation operators in the formal context are denoted as , respectively, where is defined as: for and y ∈ V, if x ∈ U satisfies xRy for some j ∈{i1,i2,…,i}.

Remark 14

Let and be as in Definition 12. U × V × W can be decomposed into n different spaces U × V × w(j = 1,2,…,n). In other words, U × V × W is a combination of n different spaces U × V × w(j = 1,2,…,n), where . We analyze the formal context given in Definition 12 as follows. For w ∈ W, there is one and only one formal context corresponding to w since U ∩ U = w ∩ w = ∅(i≠j;i,j = 1,2,…,n). If w≠w, then holds since U ∩ U = ∅ implies that is not defined for any x ∈ U(i≠j;i,j = 1,2,…,n). Furthermore, combining Lemma 1 and , we know that is not defined (i≠j;i,j = 1,2,…,n). Let . means that there is one and only one j ∈{i1,i2,…,i} such that xRy holds in the formal context , since U ∩ U = ∅(p≠q;p,q ∈{i1,…,i}). We will use an example to show the existence of the formal contexts in Definition 12.

Example 5

Table 4 shows some of the biological information in [75, Table 4]. Some features of stridulatory files Let a1 := japonica1,a2 := japonica2,a3 := neochlora1,a4 := neochlora2, a5 := neochlora3,a6 := antipodasp.nov.1,a7 := antipodasp.nov.2;b1 :=‘The number of teeth in the distal part’, b2 :=‘The number of teeth in the proximate part’; w1 := Korea,w2 := China, and w3 := Australia. Then, the mathematical expression of Table 4 is shown in Table 5.

Table 5

Mathematical expression of Table 4

	b₁	b₂
a₁	4(9)	61	w₁
a₂	6	57-60	w₁
a₃	10	66	w₂
a₄	5	72	w₂
a₅	7	68	w₂
a₆	12	45	w₃
a₇	12	51	w₃

Mathematical expression of Table 4 From Table 5, we can obtain T4, as shown in Table 6.

Table 6

A part T4 of Table 5

	b₁	b₂
a₁	4(9)	61
a₂	6	57-60
a₃	10	66
a₄	5	72
a₅	7	68
a₆	12	45
a₇	12	51

A part T4 of Table 5 Using Algorithm 2 from [78] on T4, we obtain a formal context , where is shown in Table 7.

Table 7

Formal context

	b₁	b₂
a₁	1	1
a₂	1	1
a₃	1	1
a₄	0	0
a₅	1	0
a₆	0	0
a₇	0	0

Formal context Combining Tables 5 and 7, we obtain the expression of Table 5 with the language related to the formal context; see Table 8.

Table 8

Formal context language’s expression corresponding to Table 5

	b₁	b₂
a₁	1	1	w₁
a₂	1	1	w₁
a₃	1	1	w₂
a₄	0	0	w₂
a₅	1	0	w₂
a₆	0	0	w₃
a₇	0	0	w₃

Formal context language’s expression corresponding to Table 5 In Tables 7 and 8, ‘1’ means that a has b, and ‘0’ means that a does not have b(i = 1,2,…,7;j = 1,2). Let U = {a,j = 1,2,…,7},V = {b1,b2} and W = {w1,w2,w3}. Then, based on w1,w2 and w3, we can obtain U1 = {a1,a2}, U2 = {a3,a4,a5} and U3 = {a6,a7}, respectively. Hence, we obtain the formal context corresponding to w from Table 8; see Tables 9, 10, and 11 (j = 1,2,3).

Table 9

Formal context

	b₁	b₂
a₁	1	1
a₂	1	1

Table 10

Formal context

	b₁	b₂
a₃	1	1
a₄	0	0
a₅	1	0

Table 11

Formal context

	b₁	b₂
a₆	0	0
a₇	0	0

Formal context Formal context Formal context It is easy to see that U = U1 ∪ U2 ∪ U3 = {a,j = 1,2,…,7}; U ∩ U = ∅(i≠j;i,j = 1,2,3). x ∈ U ⇔ there is a unique j satisfying x ∈ U for some j ∈{1,2,3}. . In addition, we may easily obtain , i.e., Table 12, such that for ∀x ∈ U and ∀y ∈ V, xR123y ⇔ xRy if x ∈ U for some j ∈{1,2,3}.

Table 12

Formal context

	b₁	b₂
a₁	1	1
a₂	1	1
a₃	1	1
a₄	0	0
a₅	1	0
a₆	0	0
a₇	0	0

Formal context

Remark 15

We can use any algorithm to change the information table expressed by T4 to a formal context and need not always use an algorithm such as the one in [78]. However, it is possible that the obtained formal context will not completely match Table 6. Even so, this does not affect the research method and results provided in this paper. Based on the source of the specimens, Table 4 can produce three formal contexts . In fact, biologists can discuss the relationships among specimens belonging to different locations to determine where their predecessors come from. Furthermore, it may be possible to find other biological content.

Lemma 7

Let U,V,W be given as in Definition 12. Then for any b ∈ V and . for any . The first property of Lemma 7 can be easily verified by Definitions 2 and 12. The second property can be easily verified by the combination of Lemma 1 and item (1). The proofs of these two items are omitted.

Lemma 8

Let U,V, and W be given as in Definition 11. In the formal context , where s ∈{1,2,…,n}, we define a relation on U as follows: . Then, is an equivalence on U. We use to denote a category in containing an element a ∈ U. Lemma 8 can be easily verified by Definition 4, and its proof is omitted. We will use an example to show Lemma 8.

Example 6

Let , and be defined as in Example 5. Using Definition 2(1) on and , we obtain , and . Combining Lemma 8, we obtain the following results: ; and ; .

Definition 13

Let U,U,V, W, and be defined as in Definition 12. In , is defined as in Lemma 8 for ∀a ∈ U(j = 1,…,n). Let . Let there are such that for some , some i0 ∈{1,…,n} and some l0 ∈{1,…,m}}. Let and . If , then and are denoted as and , respectively. We give the following definitions: . . If Low(A,B,C) = ∅, then define . If Low(A,B,C)≠∅, then define . If Upr(A,B,C) = ∅, then define . If Upr(A,B,C)≠∅, then define .

Remark 16

Let U,U,V,W, and be defined as in Definition 12. Let . Using Definition 13, we obtain the following: Low(A,B,C) = ∅ means that for any and every , there is an for any b ∈ B(j = 1,…,|C|). Combining Definition 2(1), we obtain . Therefore, holds by Lemma 7. Similarly, we find that and if Upr(A,B,C) = ∅. Clearly, (∅,V,∅) is the minimum element in the poset according to Definition 3 and the definition of . Hence, we define as reasonable in the case Low(A,B,C) = ∅ by means of the definition of the lower approximation operator in Yao [74]. (U,∅,W) is the maximum element in the poset by Definition 3 and the definition of . Hence, is reasonable in the case of Upr(A,B,C) = ∅ by the definition of the upper approximation operator in Yao [74]. We give an example of Definition 13 and Lemma 7.

Example 7

Let U(j = 1,2,3),U,V, and W be given as in Example 5. Let and . By Example 5, we know that a2 ∈ U1,a3 ∈ U2 and a6 ∈ U3. Since C = {w1,w2}, we only consider and , which are given in Example 5. In , we know that and . In , we know that and . Thus, we obtain . By Definition 13 and Example 6, we know that and . Therefore, we obtain and . In addition, and . Thus, t = 2 and δ = 2. Hence, we obtain the following: (◇1) For b1: . (◇2) For b2: . Furthermore, we obtain . In addition, we easily find that . Therefore, we have . Additionally, it is easy to see that , , and . Considering the above, we obtain the following: (◇◇1) For b1: . (◇◇2) For b2: . Furthermore, we obtain . In addition, we easily find that and . Hence, holds. From the above results, we can obtain the following: Low(A,B,C)≠∅ and Upr(A,B,C)≠∅. holds since , , and . .

Lemma 9

Let U,V, and W be given as in Definition 12. Let , , , be given as in Definition 13. Let be given as in Definition 12(2), and let δ be given as in Definition 13. Then, we can obtain the following results for any such that A≠∅, B≠∅, and C≠∅: Low(A,B,C)≠∅⇔ Upr(A,B,C)≠∅. Let and . Then, . If Low(A,B,C)≠∅, then and . . If , Low(A,B,C)≠∅, t = |B| and δ = |C|, then ; if δ = |C|. The proof of Lemma 9 can be found in the Appendix.

Remark 17

Let and be as in Lemma 9. Let . If B = ∅, then . This implies Low(A,B,C) = ∅ and Upr(A,B,C) = ∅. If C = ∅, then is not defined by Definitions 12 and 13. In biology research, A = ∅ means that no biological specimens are considered by biologists. B = ∅ means that no biological characteristics are considered by biologists. These two cases do not have any value for biological research. C = ∅ means that no locations of specimens are chosen. This has no value for biologists since W≠∅ and . Hence, in the suppositions of Lemma 9, we require A≠∅,B≠∅ and C≠∅.

Theorem 3

Let U(j = 1,…,n),U,V, and W = {w,j = 1,…,n} be as given in Definition 12. Let and , {α1,…,α}, be as given in Definition 13, and let be as given in Definition 12(2). If for any w ∈ W and satisfies for every b ∈ V (j ∈{1,…,n}), then the following statements are correct for with A≠∅,B≠∅ and C≠∅. . . The proof of Theorem 3 can be found in the Appendix. Considering Definition 8 and Theorem 3, we can determine that and are a pair of approximation operators. We give an example to explain Theorem 3.

Example 8

Let U1 = {a1,a2},U2 = {a3,a4,a5}, V = {b1,b2},W = {w1,w2} be given as in Example 5. Let A = {a1,a2,a3},B = {b1,b2} and C = {w1,w2}. Then by Example 7, we know the following: for w1: ; for w2: and . Considering the above, we obtain and b ∈ B satisfy (a1,b2,w1),(a2,b1,w1),(a2,b2,w1),(a3,b1,w2), (a5,b1,w2),(a3,b2,w2)}. Hence, we have and . Using Lemma 7(2), we have since α1 = 1 and α = 2. Hence, holds. Therefore, we confirm and . By Lemma 9(3), we obtain and . This means that .

Remark 18

We analyze Lemma 9 and Theorem 3. Considering Lemma 9(3), plays an important role in determining and . For a given , is known immediately. Furthermore, is found at the same time. Hence, finding and relies on finding . Combining items (4) and (5) in Lemma 9, we know that and can characterize the family of basic knowledge under some preconditions. Using Definitions 7 and 13 with Theorem 3, we can say that and are the lower and upper approximations with respect to formal contexts for at least one of X,Y and Z is ∅}. Therefore, under some preconditions on (U,V,W), we provide the lower and upper approximations in a ternary form to characterize . Using Definition 8 and Theorem 3, we can say that is the family of basic knowledge used to approximate for A≠∅,B≠∅ and C≠∅ with the rough set approximations and . Using Theorem 3 and Lemma 9, we can roughly say that the definitions of and in Definition 13 are the generalizations of lower and upper approximations in Definition 8 from one universe to three sets with respect to formal contexts. We can also roughly say that and generalize the rough set approximations in [50] from two sets to three sets with respect to the family of semiconcepts in formal contexts. Let U be the set of insect specimens of a group (j = 1,…,n), V be the set of morphological characteristics considered by biologists, and W be the set of sources of specimens in . By Lemma 9(2), Low(A,B,C) = ∅ implies that for every b ∈ B and any w ∈ C. This implies that no specimen in A has any of the considered morphological characteristics in B for every specimen location in C. In this case, biologists will change their ideas, such as by changing the set of considered morphological characteristics, since they hope to obtain the real phylogenetic relationships or other biological relationships among the specimens. This requires Low(A,B,C)≠∅. In a formal context , means that A is the set of objects having the attributes in B. In biology, if A is a set of insect specimens in a group and B is a set of morphological characteristics considered by biologists, then means that every specimen in A jointly has every morphological characteristic in B. That is, every specimen in A jointly has the set of ancestral morphological characteristics in B if the biologists are studying biological properties such as phylogenesis for A. This demonstrates the importance of discussing and of researching according to Theorem 3. Diagram of searching for TP-matroids in formal contexts In Section 3.2, we discuss how to construct a pair of approximation operators with the basic knowledge , which is the feasible set of a TP-matroid. Now, we consider the converse, i.e., how to establish a TP-matroid with respect to the rough set approximation operators and .

Theorem 4

Let U(j = 1,…,n),U,V,W and be described as in Definition 12, in which {α1,…,α} is as given in Definition 13. Let satisfy Low(A,B,C)≠∅. Define and . Then, and are two TP-matroids. Theorem 4 can be easily verified by combining Definition 9 and Definition 13, and its proof is omitted. We discuss some properties of the two TP-matroids given in Theorem 4.

Theorem 5

Let C)) and be given as in Theorem 4, in which satisfies A≠∅,B≠∅,C≠∅ and Low(A,B,C)≠∅. Then, we have the following: . If satisfies for every b ∈ V (j = 1,…,n), then . The proof of Theorem 5 can be found in the Appendix.

Remark 19

Example 7 shows and for some . This implies since holds by Example 7 and the definition of for in Example 7. This demonstrates that the converse of Theorem 5(1) is not correct and shows the importance of Theorem 5(2). Theorem 5 implies that the set of semiconcepts in the formal context is characterized by the families of feasible sets of two TP-matroids (U × V × W, and . The two TP-matroids are determined by the lower and upper approximations and , respectively. These facts indicate that studies of TP-matroids and approximation operators will have similar positions in research on knowledge-based fields. They also demonstrate the intimate relationships between matroid theory and rough set theory. A sketch of the process of searching for TP-matroids in formal contexts is shown in Fig. 2.

Fig. 2

Diagram of searching for TP-matroids in formal contexts

In this paper, we present two pairs of operators related to rough set approximations over three sets: and . Next, we will explore the relationships between and , and we aim to determine under what conditions they are the same. Considering Remark 18(3) and Theorems 3, 4 and 5, we can obtain the following corollary.

Corollary 1

Let U = U1 ∪… ∪ U, V, W = {w1,…,w} and be defined as in Definition 12 (j = 1,…,n). Let satisfy A≠∅,B≠∅ and C≠∅. Let be as given in Definition 13, and let be as given in Definition 12. Suppose that is a covering of (U,V,W). If satisfies for every b ∈ V (j = 1,…,n), then the following statements are correct. Let and be the rough set approximations generated by as given in Definition 11. Then, they satisfy . Let be the pair of rough approximations generated by as given in Definition 11. Then, The proof of Corollary 1 can be found in the Appendix. We will use an example to illustrate Corollary 1.

Example 9

Let U1 = {a1,a2},U2 = {a3,a5},V = {b1,b2}, and W = {w1,w2} be given in Example 5. It is clear that (1) for every b ∈ V (j = 1,2) and (2) aided by Example 8, we obtain and b ∈ B satisfy (a3,b1,w2),(a3,b2,w2),(a5,b1,w2)}. Therefore, it follows that . That is, is a covering of (U = U1 ∪ U2,V,W). Let A = {a1,a2,a3},B = {b1,b2} and C = {w1,w2}. Considering Example 8, we may easily obtain . Thus, using Theorem 4, we obtain . Furthermore, considering Definition 11, we confirm that and and X≠∅) or and Z≠∅)}. Using Definition 11, we obtain = = and . Hence, we obtain . That is, item (1) in Corollary 1 is confirmed. By Theorem 4, we obtain . In view of Definition 11, we may easily obtain . Moreover, we arrive at . Considering and the formal context language expression corresponding to (U,V,W) with Example 5, we obtain U = {a1,a2,a3,a5},V = {b1,b2},W = {w1,w2} and the formal context in Table 13 below.

Table 13

Formal context

	b₁	b₂
a₁	1	1
a₂	1	1
a₃	1	1
a₅	1	0

Formal context We may easily show that . By Definition 2(2) and Remark 2, this means that . Therefore, item (2) in Corollary 1 is confirmed.

Remark 20

Let U,V, and W be as given in Definition 12, and let be as given in Definition 13. Let . If there is an a0 ∈ U satisfying for any b ∈ V and every w0 ∈ W, then holds. Therefore, is not a covering of (U,V,W). If a0 ∈ A≠∅ satisfies for any b ∈ V and every w0 ∈ W, then we obtain Low(A,B,C) = Low(A ∖ a0,B,C), and furthermore, . Therefore, we determine that . Hence, is not a precovering TP-matroid. In addition, the above results show that a0 ∈ A≠∅ is not reasonable for the properties of . Therefore, in the assumptions of Corollary 1, we suppose to be a covering of (U,V,W). Let A≠∅,B≠∅, and C≠∅, and let be a covering of (U,V,W). Suppose that every satisfies the given condition as in Corollary 1 (j = 1,…,n). On the one hand, according to Theorem 4, we know that . Considering the proof in Corollary 1, we know that . Let . Then, we have . We obtain since and . Thus, is ⊔-closed by Definition 10. Analogously, we determine to be ⊔-closed according to Theorem 4 and Definition 10. However, from the proof of Corollary 1, we know that Low(A,B,C)≠∅. Taking this result and Lemma 9(3), we conclude that and since . Thus, we determine that (‡1) is not a covering of (U,V,W) since it generally does not satisfy . (‡2) is not a covering of (U,V,W) since it generally does not satisfy . Therefore, and may not be precovering TP-matroids. The above analysis of two cases with Corollary 1 indicates that for a TP-matroid and satisfying A≠∅,B≠∅ and C≠∅, if we assume that results (1) and (2) in Theorem 2 are correct, then we cannot determine to be a precovering TP-matroid. That is, we cannot determine the correctness of the converse proposition of Theorem 2. Hence, we cannot use Theorem 2 in the proof of Corollary 1. This result demonstrates that the two pairs of rough set approximations provided in this paper are different. Each of them has its own distinguishing features. They are two different kinds of generalizations of Pawlak’s classical rough set approximations. (3) Using the analysis in Remark 19(1) and Theorem 5, we believe that in general, and hold since and or Y ∩ B≠∅ or Z ∩ C≠∅}. (†1) is the maximum element in . (†2) , . Corollary 1 demonstrates that if (A,B) is a semiconcept in the formal context , then holds. Corollary 1 also shows the linkage between semiconcepts and the two kinds of rough set approximations provided in this paper. Since the theory of semiconcepts belongs to the research field of formal concept analysis, we use formal concept analysis to build TP-matroids based on a pair of approximation operators. Therefore, the work described at the beginning of this subsection is completed.

Conclusion and future work

This paper provides a new mathematical structure—the TP-matroid. It shows that a TP-matroid is a generalization of a matroid from one set to three sets up to isomorphism. Furthermore, using the structure of the TP-matroid and the covering of a set, we provide a precovering TP-matroid over three sets. To precover TP-matroids over three sets, we search for a pair of rough set approximations in Section 3.2. The method used here is different from already existing methods of establishing rough set approximations with matroidal structures [13, 14, 17, 59, 60], since those methods consider matroidal structures over one set, and our structures are over three sets U,V and W; that is, their structures are in one-dimensional space, and ours are in three-dimensional space. In fact, one set U is a subset of three sets (U,V,W) up to set isomorphism since . Under this idea, we can say that TP-matroids are a generalization of the matroids in [13, 14, 17, 59, 60]. In Section 4, we study some properties of rough set approximations over three sets with respect to formal contexts. All expressions here are different from those in [34-40] since our expressions are in ternary form and theirs are over two universes; our expressions are also different from those in [36] since the model of rough sets in [36] is relation-based and ours is covering-based. However, both the results here and the research results in [34-40] are based on some practical needs and are generalizations of Pawlak’s classical rough set approximations. That is, the research here may be applied in more practical studies, which is one of the goals of this paper. Furthermore, the proposal of TP-matroids enabling some rough set approximations to extract information on three sets with the help of the covering idea is a highlight of the paper. Using a pair of approximation operators aided by formal concept analysis, we build up two TP-matroids. Regarding other pairs of approximation operators such as that used in [34] to build up TP-matroids, we hope that the ideas presented here can assist in exploring these researches. Im et al. [79] discussed a new matroidal structure—the matroid cup game on , or on a kind of n-dimensional space. How can TP-matroids be generalized to an n-dimensional space U1 ×… × U, where U is a set (i = 1,…,n > 3) such that at least one of U is a universe (i = 1,…,n)? We now try to solve this problem as follows. Let satisfy the following conditions: . Let . , where if and only if for (i,j ∈{1,…,n);i is odd, and j is even). Let . Then ⇒∃(y1,…,y) ∈ (Y1,…,Y) ∖ (X1,…,X) = ((Y1 ∖ X1,…,Y ∖ X) satisfies , where (y1,…,y)≠∅. Then, is a matroidal structure, called an n-partial matroid or simply an np-matroid. Comparing the np −matroid with matroid cup game, we find the following: U can be different, but every is the same as . The matroid cup game solves the n-cup game used in practice. What are the practical needs of the np −matroid? How can np −matroids be used to simulate a continuous process such as that of Im et al. [79]? The questions raised in (2) and (3) will be answered in the future. Additionally, in the future, we will consider the following work: (*) It is well known that matroid theory provides a good platform for designing greedy algorithms, which are used widely in practice. How can a greedy algorithm be designed for a TP-matroid? How can this greedy algorithm be used to solve some problems in rough set theory that are NP-hard? (**) Sun and Ma [36] set up a fuzzy rough set model over multiple universes based on relations. How can we establish a covering-based rough set model over multiple universes and explore the relationships between the covering-based rough set model over multiple universes and that in [36]?

3 in total

Rough set approximations based on a matroidal structure over three sets.

Introduction

Some notions and properties

Some notations

Remark 1

Matroid

Definition 1

Formal concept analysis

Definition 2

Lemma 1

Remark 2

Posets and equivalence relations

Definition 3

Definition 4

Rough set

Definition 5

Lemma 2

Definition 6

Definition 7

Definition 8

Rough set approximations produced by a new matroidal structure—TP-matroid

Relationships between TP-matroids and matroids

Definition 9

Remark 3

Example 1

Example 2

Remark 4

Remark 5

Definition 10

Remark 6

Lemma 3

Lemma 4

Remark 7

Theorem 1

Remark 8

Approximations generalized by TP-matroids

Definition 11

Remark 9

Lemma 5

Remark 10

Lemma 6

Remark 11

Example 3

Example 4

Remark 12

Theorem 2

Remark 13

Approximations related to formal contexts

Definition 12

Remark 14

Example 5

Remark 15

Lemma 7

Lemma 8

Example 6

Definition 13

Remark 16

Example 7

Lemma 9

Remark 17

Theorem 3

Example 8

Remark 18

Theorem 4

Theorem 5

Remark 19

Corollary 1

Example 9

Remark 20

Conclusion and future work

1. Rough sets: past, present, and future.

2. On the neutrosophic soft set with rough set theory.

3. Insects as Novel Food: A Consumer Attitude Analysis through the Dominance-Based Rough Set Approach.