Literature DB >> 26819579

Using Self-Organizing Neural Network Map Combined with Ward's Clustering Algorithm for Visualization of Students' Cognitive Structural Models about Aliveness Concept.

Nurettin Yorek1, Ilker Ugulu1, Halil Aydin1.   

Abstract

We propose an approach to clustering and visualization of students' cognitive structural models. We use the self-organizing map (SOM) combined with Ward's clustering to conduct cluster analysis. In the study carried out on 100 subjects, a conceptual understanding test consisting of open-ended questions was used as a data collection tool. The results of analyses indicated that students constructed the aliveness concept by associating it predominantly with human. Motion appeared as the most frequently associated term with the aliveness concept. The results suggest that the aliveness concept has been constructed using anthropocentric and animistic cognitive structures. In the next step, we used the data obtained from the conceptual understanding test for training the SOM. Consequently, we propose a visualization method about cognitive structure of the aliveness concept.

Entities:  

Mesh:

Year:  2015        PMID: 26819579      PMCID: PMC4706862          DOI: 10.1155/2016/2476256

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

Although biology is named as “a science of the living things,” a clear definition of “aliveness” concept cannot be made. Because of scientific uncertainty as well as moral, legal, and theological aspects in this concept, it is difficult to make an exact definition [1]. Therefore, aliveness emerges as one of the most difficult concepts to be explained. Perhaps because of this difficulty, it is seen that there are relatively small numbers of studies examining the concept of aliveness as a main theme in the literature. Studies are generally focused on the classification of living things and mainly animals by the students (examples from other researches on the subject are given below). The studies in the field of cognitive psychology and neuroscience were mainly focused on the studies consisting of category-specific knowledge deficit or category-specific impairment about the living things (e.g., [2-8]) and the mental representation of living/nonliving distinction with the examination of the brain with modern imaging methods [9-11]. Bardel [12] revealed some conceptual models about the aliveness concept. Two of them are listed as animistic and vitalist models. In animistic model, the concepts of aliveness and motion are discussed. It is explained that motion is not only intrinsic to the living things by looking at the fact that vehicles such as cars and planes move. Also according to this model, it was stated that animals would be seen as in the forefront in terms of aliveness compared to plants, and as a result of this, it was emphasized that “animistic misconceptions” would occur. In addition to this definition of Bardel [12], it was seen that there were many studies containing the aliveness identified with motion especially in early childhood (e.g., [13-18]). Besides, there are some studies indicating the existence of animistic thinking about living things also in many elderly people [19]. Also in studies carried out in different age groups, it was emphasized that motion lays behind the fact that the subjects were interested in animals rather than plants [20, 21]. Babai et al. [22] argued that aliveness was entirely associated with motion (movement = alive) as a naive and intuitive concept, and this was riveted by protecting its existence until adulthood. Dellantonio et al. [23] suggested that categorization as an animate/inanimate was more fundamental according to distinction such as living/nonliving or biological/nonbiological. According to this distinction, concept of motion was associated with aliveness. In the same study, it was argued that human was in a different position in terms of “vitality” and was shown in a separate category. As a result, it was stated that the perception of aliveness followed up a row in the form of “human-animal-plant” [24]. In their research about aliveness, Caravita and Falchetti [25] addressed the question “Is a piece of bone taken from a live body also alive outside the body?” to the subjects. Most of the students answered this question as “yes,” and they mostly showed the style marks of motion, structure, function, and the bone as evidence. As a result of an interesting research focused on the genetic basis of living/nonliving distinction, it was reported that aliveness concept was an innate category organized in semantic memory [26]. In his researches examining the classification of animals by the students, Braund [27, 28] expressed that the alternative concepts, misconceptions, naive theories, and some preconcepts developed against scientific concepts by the students were clear barriers against the teaching of scientific subjects. In his researches stating that the students were mostly interested in vertebrates, Braund [27, 28] revealed that 12-year-olds perceived the turtle like a slug and grouped it as an invertebrate animal. Some of the 12- and 15-year-old students described the penguin as bird, some of them described it as fish, and some of them described it as mammal. This situation showed that students took into account the external appearances of the living things while reconstructing the aliveness concept as well as motion [16]. In his study investigating the “personal taxonomic criteria” used by the students while classifying animals, Kattmann [29] suggested that students mostly used nonscientific classification criteria (swimmers, four-legged, two-legged, flying, etc.). Also, he stated that nontaxonomic criteria were at higher rate than the taxonomic criteria used. Kattmann [29] stated that students mostly took into account the external appearances rather than biological classification while classifying animals, and they performed grouping according to the motion forms (flying, swimming, and so on) that they used for the purposes of the place they lived in and change of location. Moreover, he stated that this method used by the students to classify the animals remained unchanged even though they were taught biological classification. Similar results were reported also in the studies carried out on Turkish students [30]. Some researches examining the thoughts of the students about plants are found in the literature even though they are few in number. In one of these, it was stated that students mostly evaluated the plants by their anatomic properties and external appearances while grouping them (Tunnicliffe & Reis, 2000). Wandersee and Schussler [31, 32], as botanist and biology educators, investigated the reasons why people showed interest in animals rather than plants. As a result of their studies, they formulated the plant blindness theory as a new concept. Wandersee and Schussler [31, 32] described this theory as follows: incomprehension of the value that plants had for the atmosphere and the human life, inability to appreciate the aesthetic and biological properties of the plant kingdom, and, consequently, coming to the conclusion that animals were more valuable than plants in terms of benefit to humans with an anthropocentric perspective. There are numerous studies related to artificial intelligence approach in which conceptual modeling is “generally” evaluated (some of the leading ones, [33-36]). Besides, there are also studies about cognitive structuring modeling of concepts in relation to living creatures. Among these, we see that studies in which texts that include animal names (mostly simple sentences) are analyzed by using SOM are much more common [37, 38]. Furthermore, Ritter and Kohonen [37] have formulated the self-organizing symbol map by using logical similarities of some animals (duck, dog, cat, lion, etc.) and their characteristics (being small, being big, having 2 legs, having 4 legs, liking to fly, liking to swim, etc.). Although it is suggested that anthropocentric and animistic misconceptions are effective in cognitive structuring of aliveness concept, it is not that easy to explicitly visualize and expose that structuring. Yorek et al. [24] have questioned whether there can be a distinction among living creatures due to aliveness based on “animistic-anthropocentric model” they have developed. In another study, which uses the same model, subject was evaluated with fuzzy and rough set approaches and a mathematical model was suggested [39]. We initiated this study with the question “How can we use SOMs in analyzing and visualizing cognitive structure of aliveness concept?” with the motivation of the idea of so called mathematical approach. How we gathered data and how we realized the analysis are explained in the following chapters of this study.

2. Artificial Neural Networks

Artificial neural networks (ANNs) are mathematical models inspired by biological neural networks contained in human brain. Having similar characteristics to those of biological neural networks (i.e., consistency, flexibility, parallel function, tolerance to errors, etc.), these systems attempt to learn tasks and determine how they will react to new tasks by means of creating their own experiences through the data obtained by using the predetermined samples [40]. Neural networks can be used to model complex relationship without using simplifying assumptions, which are commonly used in linear approaches [41]. The other advantages of the ANNs are the ability to represent both linear and nonlinear relationships, the ability to learn these relationships directly from the data used, not needing to take into account detailed information of structures and interactions in the systems, and the fact that they are regarded as ultimate black-box models. At least in some cases if not always, that is, for prediction using the trained network, the ANN systems are alternative to experimentation and save a lot of time which may have been consumed since experimentation is so difficult and in some cases is impossible. Artificial neurons based on biological neurons were first defined by McCulloch and Pitts [42]. McCulloch-Pitts (MCP) neuron model is given in Figure 1.
Figure 1

Representation of an artificial neuron.

In all neural network models, x input values are multiplied by w connection weights and then summed up. Summation unit is compatible with the body of biological neuron. It sums up weighted inputs and then gives the net output (y = f(net)).

2.1. Self-Organizing Maps

A self-organizing map (SOM, also known as Kohonen map) is a type of the artificial neural algorithm and is based on unsupervised learning. The structure of SOMs is composed of two layers fully attached to each other: input layer and Kohonen layer [43]. Kohonen layer is also the layer where the map is formed that will ensure the observation of clustering in the data set. Hidden layers are not in question as in prediction or classification studies. Neurons in the Kohonen layer are generally arranged two-dimensionally. The number of neurons in the input layer is equal to the number of variables used. Each neuron in the input layer is connected to each neuron in Kohonen layer as feed forward. Inputs can be calculated by the following for the Kohonen layer: w in (1) is the weight of the connection outgoing from the input neuron represented by i to the neuron represented by j in Kohonen layer. The weight vectors are collectively known as the SOM's memory [44]. d represents the number of variables. When considering the winner-take-all paradigm, the neuron taking the highest y value in Kohonen layer will be the winner neuron. The SOM algorithm firstly assigns small random values to the connections between input layer and the Kohonen layer. Then, the algorithm undergoes three essential processes. These are competition, cooperation, and adaptation [45]. Competition Process. A random observation (student) is selected from the data set. This observation can be expressed as x = (x 1, x 2,…, x ). In Kohonen layer, the expression of weights of the neuron j is possible as follows: The expression of d in (2) represents the total number of neurons in the Kohonen layer. In order to find the best match of weight vectors w , w ,…, w with x input observation, w 1 x, w 2 x,…, w x scalar products are calculated and the largest one is selected. The criteria of finding the best match based on the selection of the largest one of the scalar products are equal to the mathematical maximization of Euclid distance between w vector and x. Therefore, the index of the winning neurons for observation is calculated as follows:For each input in the competition process, neurons in the model are in competition with each other. Cooperation Process. In the cooperation process, a topological neighborhood is determined, and the cooperating neurons will settle according to the topological neighborhood such that the winner neuron will be at the center. The winner neuron determines the topological value of the neurons affected by competition; therefore, cooperation is ensured between nodes. Adaptation (Synaptic Compatibility). Neurons affected by competition arrange their synaptic weights according to the example. The new weight vector in (s + 1) cycle of neuron j is calculated as follows:Here, η(s) is the learning parameter and h (s) is the neighborhood function. Neurons are connected to each other by neighborhood relation. This neighborhood relation determines the structure or topology of the map. Figure 2 represents a simple SOM. Kohonen layer consisting of 9 × 7 neurons appears in the figure. The input vector is represented by x and there are variables in this vector (it is expressed that n number of variables exist in the expression of x ). The weight vector is represented by w and it contains the weights outgoing from each variable to the each neuron. The yellow-colored neuron represents the winner neuron and the surrounding neurons are its neighbors.
Figure 2

SOM input and output layers.

2.2. The Ward Clustering in General

The Ward classic clustering method is a hierarchical agglomerative cluster algorithm. Clustering process is initiated by accepting each node as a separate cluster. Then, at each stage of the algorithm, the clusters with minimum distances between themselves (according to the distance measure defined by a specific algorithm) are combined in pairs. This smallest distance is called the Ward distance and defined as follows:Here, r and s represent the two distinct clusters, n and n represent the data points of two clusters, x and x represent the center of gravity of the clusters, and ‖·‖ is Euclidean norm. Starting from the full distance matrix (lower triangle matrix as the distance measure is commutative), a row and a column are removed in each step until the matrix is completely cleared and only one cluster will remain (and a different row and column are updated). The mean and cardinality of the new cluster built as a product of the combination phase are calculated as follows:

2.3. The SOM-Ward Clustering

The two main ways to cluster data are hierarchical and partitive approaches. The hierarchical methods can be further divided into agglomerative and divisive algorithms, corresponding to bottom-up and top-down strategies, to build a hierarchical clustering tree. Of these, agglomerative algorithms are more commonly used than the divisive methods [46]. The SOM-Ward clustering approach is a two-level clustering approach that uses Ward's clustering algorithm to determine the SOM and clustering results. The Ward clustering algorithm is an agglomerative hierarchical clustering method [18, 47]. Agglomerative clustering algorithms usually have the following steps [46]: Initialize: assign each vector to its own cluster. Compute distances between all clusters. Merge the two clusters that are the closest to each other. Return to step (2) until there is only one cluster left. In the SOM-Ward clustering approach, process begins by accepting each node as a separate cluster. Until one cluster will remain on the map, clusters with minimum Euclidean distance between themselves are combined with each other in pairs. While determining the distance, both the Ward distance and the topological properties of SOM are taken into account. In other words, the distance between two nonassociated clusters is considered as infinite and only the associated clusters are combined. Low SOM-Ward distance value represents a more natural clustering for the map, and high value represents an artificial clustering for the map. By this means, users can select the optimal cluster number in a flexible manner.

2.4. Cognitive Structural Modeling and SOMs

Although using of SOM is recommended in conceptual modeling studies [34, 35], it is suggested that SOM modeling in executive functions of brain such as reasoning and language faces some difficulties. Since self-organized maps reflect simple distance relations among input vectors, they mostly characterize lower levels of perception. For high level processing, discrete symbols are needed. Maps formed by these symbols in brain are composed of logically related symbols coating neighboring areas [37]. Similarly, Gärdenfors [33, 48], while explaining representing information on conceptual level in his conceptual spaces theory, associates quality dimensions with similarity and distance concepts [34, 49, 50]. When the neural adaptation law (4) is applied to a vector variables set, a topographic map is being obtained that shows logical distance among symbols. But logical relatedness between different symbols cannot be directly determined by their encodings. At least in learning process, symbols should be regularly presented due to their attribute values. From these, we can come up with the conclusion that symbols with similar characters are represented close to each other on map [37]. Results obtained from text analysis prove that SOMs can be safely used in high dimensional data analysis such as independent component analysis (ICA), principal component analysis (PCA), and singular value decomposition (SVD) [35, 38, 51]. In these studies, SOMs can be used single, while they can also be combined with different models (Bayesian, etc.) [36]. This study reports on the discussion of how to utilize the SOMs in the modeling and visualization of the cognitive construction of aliveness concept. In this regard, the questions that we searched for an answer to are as follows: Which living things are primarily associated with the aliveness concept by the subjects? and How can we use SOMs in modeling and visualization of cognitive construction of aliveness concept?

3. Method

Qualitative data obtained from student answers to open-ended questions were used to create SOM-Ward model.

3.1. Subjects

The participants included 100 students (55 female and 45 male) who were selected via cluster sampling method from nine high schools in Izmir, a large city in western Turkey. Schools accepted students from different parts of the city and students varied in terms of socioeconomic status.

3.2. Data Collection

In this study, a conceptual understanding test was used. The test included open-ended questions and was developed by researchers. In addition, to clarify vague concepts and to obtain in-depth information about the topics, interviews were conducted with students and teachers. The final version of the test used in this study is presented in Table 1.
Table 1

Conceptual understanding test.

QuestionsAim
1 Write the names of the first 10 living things coming to your mind.Uncovering which living thing is primarily associated with the aliveness concept.

2It is estimated that there are millions of species living on Earth. If you were asked to classify all the living things (types, species) into main groups, without leaving anyone out, at least how many groups could you form?Uncovering which properties of the living things are paid attentions by the students while dividing them into groups, how they created their own taxonomic groups, how they expressed this conceptually, or how they called the groups.

3When considering the all living things, what is the place (position) of human? Please explain.Clarifying the reasons why the students evaluated the human in a separate category while grouping the living things in the nature based on their own statements.

4. Results

4.1. Conceptual Understanding Test Results

Question 1 .

When analyzing all of 10 living things written by the students, those most frequently repeated were, respectively, human, dog, cat, mouse, and rabbit. It was observed that 107 (74.31%) of 144 different living things written were animals, 26 (18.05%) of them were plants, and the remaining 11 (7.64%) were other living things. Those that were most frequently written in animals (51.43%) were the mammals. It was observed that it was the human which was mostly written to the first rank among 10 living things. When looking at the answers of the students who wrote at least one plant name among the answers, it was observed that they wrote plant to the 6th rank among 10 living things. According to this, it could be said that students structured the “aliveness concept” by associating with the animals, particularly with human. In this structuring, it was believed that plants and other living things came after human and animals in terms of aliveness.

Question 2 .

When analyzing the answers of the students to the second question about the separation of living things (types of living things) into certain groups, about one-third of the students (35%) created the groups consisting of “animal, plant, and human.” The most interesting result of the answers given to this question was that of 24% of students; in other words, one out of every four students created just “animal” groups during grouping.

Question 3 .

When analyzing the answers of the students to the third question about the position of human within all living things, 78% of students emphasized the human's ability of being intelligent and of thinking. Starting from this point, they defined the position of human as the “most excellent,” “most sophisticated,” and thus “the most supreme being.”

4.2. Creating the SOM-Ward Model

4.2.1. Data Preprocessing for Training of the SOM-Ward Model

For the training of the SOM-Ward model, the answers given to the first question were taken into account by the students' conceptual understanding test. The data obtained from the other questions were used in the verification of the answers given to the first question and in the interpretation of the SOM-Ward model. Firstly, 144 living things written by the students were collected under 10 groups (Table 2). While these groups were determined, animals were mostly included as students wrote the name of animal at the most. Then, a code was defined for each group (Table 3).
Table 2

Grouping of living things written by the students (as it is written).

12345
EuglenaDaisyBlackberryParasite(s)Sea cucumberFish(es)Frog
BacteriumPlant(s)LotusEarthwormTickShark
AmoebaTree(s)JasmineSpiderSnailStingray
MushroomRoseMistletoeBeetle(s)Hydra
AlgaeFlower(s)PurslaneButterflyInvertebrate
ParameciumCactusDaffodilBeeJellyfish
MoneraGrassKiwiGrasshopperCalamari
ProtistsFernSnowdropScorpionMussel
PlasmodiumTulipWillowFlyMantis
PlanktonChrysanthemumGeraniumAntSponge
PlanariaSunflowerBegoniaCaterpillarCockroach
HyacinthVioletCancerLadybug
MillipedeOctopus

678910

SnakeBird(s)LionSquirrelDogHuman, human names
CrocodileEagleBearPandaCat
LizardBatTigerMonk sealMouse
TurtlePenguinMonkeyLeopardRabbit
ReptileParrotGiraffeWeaselCow
ChameleonFalconElephantPantherHorse
IguanaPigeonWhaleVertebrateDonkey
DragonDuckWolfTapirSheep
Cock, RoosterFoxGazelleOx
Chicken, ChickGorillaCougarCamel
OwlDolphinSkunkGoat
PartridgeKangarooCheetahMammal(s)
SparrowKoalaRhinocerosCalf
HawkHippopotamusOtterLamb
CanaryZebraAnteaterBuffalo
OstrichHedgehogHyenaLama
VultureAnimal(s)PorkMule
LynxBull
Table 3

Groups' definitions (according to biological classification).

CodeGroupsExplanation
1Prokaryotes, protists, and fungusesBacterium, algae, amoeba, unicellular eukaryotes, Mushrooms, and so forth
2PlantsFlowering and nonflowering plants
3AnimalsAll invertebrates, insects, mollusks, and so forth
4AnimalsFishes
5AnimalsAmphibians
6AnimalsReptiles
7AnimalsBirds
8AnimalsWild mammals
9AnimalsDomestic mammals and pets
10AnimalsHuman
In the next step, living things written by each student (S1, S2,…, S100) were coded and tabulated by preserving their ranks (R1, R2,…, R10). An example of students' responses is seen in Tables 4 and 5, respectively.
Table 4

An example of students' responses (first question of conceptual understanding test).

StudentsLiving things written by the students
R1 R2 R3 R4 R5 R6 R7 R8 R9 R10
S46 HumanMonkeyDonkeyBirdCowCamelFrogTurtleMouseCat
S47 CatDogTreeEagleDolphinPigeonLionWhaleDaisySnowdrop
S48 LionHumanPlantPandaSnakeDolphinZebraPenguinBearWolf
S49 HumanPlantDogAlgaeBacteriumMushroomBirdLionLeopardCat
S50 DolphinDogHumanDaisyRoseOxBearWhaleMonkeyChrysanthemum
Table 5

Coding of students' responses.

StudentsCoded data of living things
R1 R2 R3 R4 R5 R6 R7 R8 R9 R10
S46 10897995699
S47 9927878822
S48 81028688788
S49 10291117889
S50 89102298882
In the next phase, standardization process was performed because the data set presented in Table 5 is categorical. However, there was no difference among these values as these values (the numbers from 1 to 10) symbolized the groups. In case of using these data as such, analysis results will be incorrect as “1” will bear the meaning of “the lowest” and “10” will bear the meaning of “the highest.” In this respect, a new arrangement was made for figures to denote the same meaning for the variables. A new table was created by calculating on which ranks the students wrote the living things on average. In this case, the living thing with low average rank would be more important in terms of “aliveness” compared to the living thing with high average rank. And this value is of the same meaning for each living thing group. We determined the average rank of the living things. The average rank was determined as follows: for a group of living things we found, from one to 10, at which ranks (R1, R2,…, R10) they were listed. Then, for each line, these values were added together and then divided by the total frequency, for example, code 9 for the student S1 in Table 5; this code was listed five times on the 3rd, 5th, 6th, 9th, and 10th ranks. Accordingly, the average rank was calculated as (3 + 5 + 6 + 9 + 10)/5 = 6.60. The following is an example which was formed according to Table 5 in this way (Table 6).
Table 6

A sample of average ranking of the living things.

StudentsMonera, protists, andfungusesPlantsInvertebratesFishesAmphibiansReptilesBirdsWild mammalsDomestic mammalsHuman
x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10
S46 0.000.000.000.007.008.004.002.006.601.00
S47 0.007.330.000.000.000.005.006.671.500.00
S48 0.003.000.000.000.005.008.006.170.002.00
S49 5.002.000.000.000.000.007.008.506.501.00
S50 0.006.330.000.000.000.000.006.254.003.00
The data in Table 6 are still not suitable for training the SOM. Because if “human” is mentioned as the first object, that is, as the most prominent example of “living,” “human” gets a value of 1 according to the rank in the list (plus an averaging over objects belonging to the same biological category, which is fine). However, this leads to the emergence of a problem. If any entity belonging to biological category “fish” is not mentioned at all by a student in the 10 first alive objects list, it gets a value 0.00 (see, e.g., first row of Table 6). This means that, in this metric of similarity, “no mention” is the closest to “mentioned as the first thing in the list.” Of course, “no mention” should be most similar to “mentioned as the last thing in the list.” If something is not mentioned by a subject, it is not a very prototypical sample of “alive” and therefore should have a low value of “aliveness.” All the data in Table 6 were subtracted from 11 for eliminating the problems; that is, if “human” is mentioned as first thing in the list of 10 objects, it gets “aliveness” value of 11 − 1 = 10. If it is the second, it gets value of 9. And so on, the 10th object gets value of 1. Then, something that is not mentioned at all, such as the “fish” for student S46, can then get value of 0. Accordingly, Table 7 was created.
Table 7

Inputs for training the SOM-Ward Model.

StudentsMonera, protists, andfungusesPlantsInvertebratesFishesAmphibiansReptilesBirdsWild mammalsDomestic mammalsHuman
x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10
S46 0.000.000.000.004.003.007.009.005.6010.00
S47 0.004.330.000.000.000.006.005.679.500.00
S48 0.003.000.000.000.006.003.005.170.009.00
S49 6.009.000.000.000.000.004.003.505.5010.00
S50 0.005.330.000.000.000.000.005.257.008.00
The values exemplified in Table 7 were calculated for each student. This 10-dim data (x 1, x 2,…, x 10) were used in the training of the SOM-Ward model (Table 9).
Table 9

Inputs data for training the SOM-Ward Model.

StudentsMonera, protists, andfungusesPlantsInvertebratesFishesAmphibiansReptilesBirdsWild mammalsDomestic mammalsHuman
x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10
S1 00000384.679.55
S2 03.5000003.597
S3 000006.339740
S4 002.590035.6010
S5 0000000005.5
S6 65300001.58.54
S7 00300005.256.20
S8 9.50000023.7558
S9 000005.3381.570
S10 000004.5082.50
S11 0000001206.5
S12 0231004916.676
S13 00010004.57.20
S14 000943104.337.50
S15 00890063710
S16 05890270010
S17 04.3365000007
S18 005.3300408.54.50
S19 06500003.757.57
S20 3.505.500084.5010
S21 00000035.67.671
S22 04000204.756.6710
S23 001.5000476.338
S24 110900027.553
S25 0805002.562.510
S26 71.5103084600
S27 00000304.676.2510
S28 7000348210
S29 5.53000109800
S30 0403000781.5
S31 67300007010
S32 0070009.5804
S33 64.51009.5034.58
S34 05.50000705.290
S35 05.50700836.510
S36 0000000055.56
S37 068003426.670
S38 01.5000506.337.670
S39 0700001042.50
S40 1.500005.5094.679
S41 6.25900000708
S42 75.530095000
S43 01010027.5663
S44 009003104.55.52
S45 005.5002.57862
S46 000043794.410
S47 03.67000064.339.50
S48 08000634.8309
S49 69000042.54.510
S50 04.67000004.7578
S51 06.5000047.3340
S52 02005005.678.510
S53 004.60701059
S54 002.5207.538.3350
S55 05.334.33000605.59
S56 0900002.33369
S57 09000144.33610
S58 000746102.57.331
S59 006271849.50
S60 005.540182.59.57
S61 001005.506.334.6710
S62 0510204307.674
S63 002.560083.338.670
S64 10720800590
S65 1.55.7540000097
S66 0200004787
S67 044026.560100
S68 05.5010405.5710
S69 0404037095.5
S70 070904.53.5072
S71 0001006.51.5590
S72 00080266.529
S73 00200837.42.50
S74 071.5000648.510
S75 04.250003608.334
S76 00050265.47.50
S77 002.591007.56.250
S78 100109655.673.50
S79 01.53.5500707.6710
S80 12003004.57.510
S81 4.51200669.500
S82 07200814.5610
S83 08000005.54.675.67
S84 079003.501.57.250
S85 00000037.331.55
S86 8.3431000057.50
S87 90820005110
S88 6401006590
S89 75.33602035100
S90 603.33003578.50
S91 071080293.3354
S92 096.677016.5302
S93 5.60300064.590
S94 70800166.2503
S95 691001072.585
S96 5.25840530600
S97 60060001.5103
S98 026.500526910
S99 5.58060501.59.53
S100 01.534006.56.5910

4.3. The SOM-Ward Model Results

For clustering and visualization of cognitive construction of aliveness concept, the SOM-Ward model was trained by batch training algorithm. Matlab and Viscovery SOMine software were used in the creation of SOM [52].

4.3.1. Clustering and Visualization of the Clusters

The SOM cannot fully give detailed information about the clusters. In the Viscovery SOMine program, hierarchical clustering algorithm and the SOM are used together, and the interpretation of the resulting maps will be easier. This clustering algorithm operates as follows: firstly, each student (observation) is divided into a separate cluster. In this way, the number of clusters will be equal to the number of observations. Then, in each step, observations which are the closest to each other are combined according to the SOM-Ward distance measure. In this special distance unit, both the distance of the two clusters and the location on the map are taken into account. All 10 variables were used to reveal the clusters in the data set. To what extent will the neurons behave competitively with one another in the training phase is determined by the tension parameter of the map. The smaller this value is, the more the map tries to harmonize itself to the data set. This value varies between 0.3 and 2 and it was determined as 0.5 in this study. 1000 neurons were used in Kohonen layer and the emerging clusters are shown in Figure 3.
Figure 3

SOM-Ward clusters after training.

U-matrix emerged after the formation of Ward agglomerative hierarchical clustering and SOM together is shown in Figure 4. Number of clusters will be determined with the help of this matrix. Light colors in this map represented that there were more students; therefore, there was a clustering. Dark colors in this map showed that there was not an observation in that part of the map. Therefore, dark colors on the map determined the cluster boundaries.
Figure 4

U-matrix.

The number of clusters can be determined by looking at U-matrix; however, there may be some personal judgments as people's perceptions will come to the forefront. In the literature, different methods in determining the number of clusters are recommended using U-matrix [53, 54]. Depending on our training data (Table 7), Kohonen layer is divided into three clusters. Related to these clusters, observation percentage in clusters and the percentage of each variable present in the related cluster are given in Table 8.
Table 8

Clusters and descriptive statistics.

Groups of the living thingsClusters
C1C2C3
39%32%29%
Prokaryotes, protists, and funguses0.410.204.81
Plants3.592.812.75
Invertebrates1.812.433.06
Fishes0.544.780.48
Amphibians0.490.840.62
Reptiles1.622.323.08
Birds1.736.503.50
Wild mammals4.123.365.62
Domestic mammals and pets5.766.334.06
Human7.153.302.34
Generally, in clustering studies, it is seen that each cluster is named. Such a naming study is usually performed by the researcher. Clusters were named also in this study. Distinguishing characteristics of clusters can be expressed as follows. C1-Anthropocentric Cognitive Structure. 39% of students in the data set taken into account in this study are in this segment. It is seen that human, mammals, and plants stand out. C2-Primary Animistic Cognitive Structure. 32% of the data sets are in this segment. Domestic and/or familiar animals stand out, for example, fish, birds, and domestic mammals. C3-Secondary Animistic Cognitive Structure. 29% of students are clustered in this segment. Wild and/or unfamiliar animals, for example, wild mammals, reptiles, and invertebrates, prokaryotes, protists, and funguses stand out. It is highly apparent that the spatial order of the responses has captured the essential “family relationships” among the living things [37]. Clusters responding to, for example, “human,” occupy the right part of the lattice, clusters responding to “familiar animals” such as “birds,” “domestic mammals,” and “pets” gather towards the left, and clusters responding to more “wild” species such as “reptiles,” “wild mammals,” and “invertebrates” aggregate in the lower middle. Within each cluster, a further grouping according to similarity is discernible [37]. The component planes of the map are shown in Figure 5.
Figure 5

The component planes of the map.

The resulting SOM-Ward model consists of three clusters. The component planes (Figure 5) shows the distributions of each variable (groups of the living things) across the map, on which the color scale visualizes the distribution of each variable over different clusters. Warm colors indicate more significant for construction of aliveness concept, while cold ones indicate the reverse; for example, fishes were mainly found in cluster C2 and human in C1.

5. Conclusion

Cluster analysis is a dimension reduction method. In this study, cluster analysis was applied to the data set obtained from the students with regard to the aliveness concept with the help of SOM. 10 variables of 100 students were expressed with the help of two-dimensional maps. As a result, 3 clusters which we called C1, C2, and C3 emerged with regard to the cognitive construction of aliveness concept. Properties of these clusters and the emergence of different colors according to the input variables are thought to be an important step for us to “be able to understand” the aliveness concept. According to our literature survey, it was observed that the cognitive construction of aliveness concept was closely associated with anthropocentrism and animism [13–19, 22, 24, 31, 32, 39, 55]. The results obtained from this study appeared to be consistent with the literature. As an additional property, the visualization of the results by using SOM enabled the clusters to form the cognitive structure and the characteristics of this cluster to be seen clearly. Another issue is the optimization of the structures of the SOMs. For instance, different structures can be obtained by changing the number of neurons in the output level. Similarly, different maps will emerge when learning parameter is changed. Another issue is the fact that the results of the analysis will actually show deviation when irrelevant variables are used. Different methods have been developed to optimize the structures of the SOMs. This classification process performed with the help of SOMs can be renewed by a different data set. For instance, there might be some variables that might increase the performance of the clustering analysis when they are removed from the analysis. The determination and removal of such variables with other methods may ensure analysis to have higher performance. For this, primarily, determining a reliable performance criterion is required. Recent analysis that became possible by increasing the processing capacity of the computers can be applied for modeling in the field of cognition and neuroscience. It can be expected that the SOMs can also be used to solve different problems in this field due to the power of visualization they have.
  18 in total

1.  The selective impairment of fruit and vegetable knowledge:amultiple processing channels account of fine-grain category specificity.

Authors:  Sebastian J Crutch; Elizabeth K Warrington
Journal:  Cogn Neuropsychol       Date:  2003-05-01       Impact factor: 2.468

2.  The fur of the crocodile and the mooing sheep: A study of a patient with a category-specific impairment for biological things.

Authors:  Regine Kolinsky; Patrick Fery; Diana Messina; Isabelle Peretz; Sylvie Evinck; Paulo Ventura; Jose Morais
Journal:  Cogn Neuropsychol       Date:  2002-06-01       Impact factor: 2.468

3.  Genetic and environmental influences on the organisation of semantic memory in the brain:is "living things" an innate category?

Authors:  Martha J Farah; Carol Rabinowitz
Journal:  Cogn Neuropsychol       Date:  2003-05-01       Impact factor: 2.468

4.  Visual and semantic processing of living things and artifacts: an FMRI study.

Authors:  Gian Daniele Zannino; Ivana Buccione; Roberta Perri; Emiliano Macaluso; Emanuele Lo Gerfo; Carlo Caltagirone; Giovanni A Carlesimo
Journal:  J Cogn Neurosci       Date:  2010-03       Impact factor: 3.225

5.  Category-specific knowledge deficit for animals in a patient with herpes simplex encephalitis.

Authors:  Carlo Blundo; Monica Ricci; Laurie Miller
Journal:  Cogn Neuropsychol       Date:  2006-12-01       Impact factor: 2.468

6.  A logical calculus of the ideas immanent in nervous activity. 1943.

Authors:  W S McCulloch; W Pitts
Journal:  Bull Math Biol       Date:  1990       Impact factor: 1.758

7.  Reflectivity, internality, and animistic thinking.

Authors:  M D Berzonsky
Journal:  Child Dev       Date:  1974-09

8.  A case series analysis of "category-specific" deficits of living things:the hit account.

Authors:  Glyn W Humphreys; M Jane Riddoch
Journal:  Cogn Neuropsychol       Date:  2003-05-01       Impact factor: 2.468

9.  Child animism; what the child means by alive.

Authors:  S W KLINGENSMITH
Journal:  Child Dev       Date:  1953-03

10.  Multistrategy self-organizing map learning for classification problems.

Authors:  S Hasan; S M Shamsuddin
Journal:  Comput Intell Neurosci       Date:  2011-08-16
View more
  4 in total

1.  On the Use of Self-Organizing Map for Text Clustering in Engineering Change Process Analysis: A Case Study.

Authors:  Massimo Pacella; Antonio Grieco; Marzia Blaco
Journal:  Comput Intell Neurosci       Date:  2016-12-04

2.  A Novel Classification and Identification Scheme of Emitter Signals Based on Ward's Clustering and Probabilistic Neural Networks with Correlation Analysis.

Authors:  Xiaofeng Liao; Bo Li; Bo Yang
Journal:  Comput Intell Neurosci       Date:  2018-11-05

3.  Self-Organising Map Based Framework for Investigating Accounts Suspected of Money Laundering.

Authors:  Abdallah Alshantti; Adil Rasheed
Journal:  Front Artif Intell       Date:  2021-12-14

4.  Continuous Psychological Nursing Based on Grey Clustering Algorithm in Patients after Transurethral Resection of Prostate.

Authors:  Peiting Lu; Cuiyun Wu
Journal:  Comput Math Methods Med       Date:  2022-07-28       Impact factor: 2.809

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.