| Literature DB >> 34658654 |
Kingsley Okoye1, Arturo Arrona-Palacios1, Claudia Camacho-Zuñiga2, Joaquín Alejandro Guerra Achem3, Jose Escamilla4, Samira Hosseini1,5.
Abstract
Recent trends in educational technology have led to emergence of methods such as teaching analytics (TA) in understanding and management of the teaching-learning processes. Didactically, teaching analytics is one of the promising and emerging methods within the Education domain that have proved to be useful, towards scholastic ways to make use of substantial pieces of evidence drawn from educational data to improve the teaching-learning processes and quality of performance. For this purpose, this study proposed an educational process and data mining plus machine learning (EPDM + ML) model applied to contextually analyze the teachers' performances and recommendations based on data derived from students' evaluation of teaching (SET). The EPDM + ML model was designed and implemented based on amalgamation of the Text mining and Machine learning technologies that builds on the descriptive decision theory, which studies the rationality behind decisions the learners are disposed to make based on the textual data quantification and statistical analysis. To this effect, the study determines pedagogical factors that influences the students' recommendations for their teachers, what role the sentiment and emotions expressed by the students in the SET play in the way they evaluate the teachers by taking into account the gender of the teachers. This includes how to automatically predict what a student's recommendation for the teachers may be based on information about the students' gender, average sentiment, and emotional valence they have shown in the SET. Practically, we applied the Text mining technique to extract the different sentiments and emotions (intensities of the comments) expressed by the students in the SET, and then utilized the quantified data (average sentiment and emotional valence) to conduct an analysis of covariance and Kruskal Wallis Test to determine the influential factors, as well as, how the students' recommendation for the teachers differ by considering the gender constructs, respectively. While a large proportion of the comments that we analyzed (n = 85,378) was classified to be neutral and predominantly interpreted to be positive in nature considering the sentiments (76.4%), and emotional valence (88.2%) expressed by the students. The results of our analysis shows that for the students' comments which contain some kind of positive or negative sentiment (23.6%) and emotional valence (11.8%); that females students recommended the teachers taking into account the sentiments (p = .000). While the males appear to be slightly borderline in terms of emotions (p = .056) and sentiment (p = .077). Also, the EPDM + ML model showed to be a good predictor and efficient method in determining what the students' recommendation scores for the teachers would be, going by the high and acceptable values of the precision (1.00), recall (1.00), specificity (1.00), accuracy (1.00), F1-score (1.00) and zero error-rate (0.00) which we validated using the k-fold cross-validation method, with 63.6% of optimal k-values observed. In theory, we note that not only does the proposed method (EPDM + ML) proves to be useful towards effective analysis of SET and its implications within the educational domain. But can be utilized to determine prominent factors that influences the students' evaluation and recommendation of the teachers, as well as helps provide solutions to the ever-increasingly need to advance and support the teaching-learning processes and/or students' learning experiences in a rapidly changing educational environment or ecosystem.Entities:
Keywords: Educational innovation; Higher education; Machine learning; Performance assessment; Teaching analytics; Text mining
Year: 2021 PMID: 34658654 PMCID: PMC8503388 DOI: 10.1007/s10639-021-10751-5
Source DB: PubMed Journal: Educ Inf Technol (Dordr) ISSN: 1360-2357
Fig. 1Educational process and data mining plus machine learning (EPDM + ML) model
Word count and average sentiment scores for the students’ comments (SET) broken down by gender
| Gender | Element id | Word count | ave_sentm |
|---|---|---|---|
| 1: | 32 | 0.000 | |
| 2: | 2 | 0.000 | |
| 3: | 2 | 0.000 | |
| 4: | 12 | 0.216 | |
| 5: | 3 | 0.000 | |
| … | … | … | |
| 45290: | 3 | 0.000 | |
| 45291: | 4 | 0.000 | |
| 45292: | 24 | 0.000 | |
| 45293: | 3 | 0.000 | |
| 45294: | 3 | 0.000 |
Note: values represent the first and last five comments in the SET data
Summary of the sentiment scores across the SET data considering the gender of the students, broken down by word count, standard deviation, and the sentiment score
| Male students | Female students | |||||
|---|---|---|---|---|---|---|
| Word count | sd | ave_sentm | Word count | sd | Ave_sentm | |
| Min | 0 | 0.00 | − 0.716 | 0 | 0.00 | − 0.707 |
| Median | 10 | 0.00 | 0.00 | 12 | 0.00 | 0.00 |
| Mean | 13.8 | 0.04 | − 0.006 | 16.2 | 0.04 | − 0.005 |
| Max | 407 | 1.14 | 1.395 | 419 | 0.65 | 1.223 |
element_id denotes the individual comments by the students; word_count represents the number of words in each comment; sd represents standard deviation of the sentiment scores in the comments; ave_sentm is the average sentiment score for the individual comments
Fragment of the emotional valence scores expressed by the students towards the teachers analyzed by gender
| Emotional valence scores | ||
|---|---|---|
| Comments | (Male students > > > teachers) | (Female students > > > teachers) |
| [1] | 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 |
| [16] | 0 0 − 1 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 0 0 0 0 0 0 0 − 1 0 0 0 0 0 |
| [31] | 0 0 1 0 0 0 0 − 1 − 1 0 − 1 0 0 0 0 | 0 0 0 0 0 0 0 0 0 − 3 − 1 0 0 0 0 |
| [46] | 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 0 0 0 0 − 1 0 0 0 0 0 0 1 0 |
| [61] | 0 − 1 0 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 − 1 0 0 0 0 0 0 0 0 0 0 0 0 |
| [76] | 4 0 0 0 0 0 0 0 1 0 0 0 2 0 0 | 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 |
| [91] | 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 0 − 1 0 0 0 0 0 − 2 0 0 0 0 − 1 |
| [106] | 0 0 0 0 − 1 0 0 0 0 0 0 0 0 1 −1 | 0 0 − 1 0 1 0 − 1 0 0 0 0 0 0 0 0 |
| [---] | --- | --- |
| Min | − 4 | − 4 |
| Median | 0.000 | 0.000 |
| Mean | − 0.012 | − 0.021 |
| Max | 6 | 13 |
Fig. 2Comments (counts) vs. Ave_Sentiment score broken down by students gender
Fig. 3Emotional valence scores for the SET comments broken down by students gender
Fig. 4Overall Emotions expressed by the students for the teachers broken down by gender of the students
Fig. 5Emotions expressed by the male students for the male vs female teachers
Fig. 6Emotions expressed by the female students for the male vs female teachers
Summary of the Emotional Valence expressed by the students towards the teachers broken down by gender
| EV | Male teachers | Female teachers | ||||||
|---|---|---|---|---|---|---|---|---|
| Min | Median | Mean | Max | Min | Median | Mean | Max | |
| Male students | − 4 | 0.00 | − 0.008 | 6 | − 3 | 0.00 | − 0.019 | 6 |
| Female students | − 3 | 0.00 | − 0.017 | 10 | − 4 | 0.00 | − 0.027 | 13 |
EV emotional valence, min minimum, max maximum
Analysis of Covariance for the Recommendation of professors by the students
| Analysis of Covariance (ANCOVA) | ||||
|---|---|---|---|---|
| Mean square | F | p-value (Sig.) | Partial Eta Sq | |
| Student_gender | 14.876 | 3.908 | .048** | .000 |
| Ave_sentiment | 210.602 | 55.324 | .000** | .001 |
| Emotional_valence | 2.983 | 0.784 | .376 | .000 |
| Student_gender*Ave_sentiment*Emotional_valence | 27.133 | 7.127 | .001** | .000 |
| Marginal mean effect (ANCOVA) | 14.892 | 3.912 | .048** | .000 |
The factors marked with asterisk (*) were the found elements or considerations that came out statistically significant in the ANCOVA test
DV dependent variable, IV independent variable
Significance levels p ≤ .05
Kruskal–Wallis test for recommendation of the teachers broken down by students’ gender
| Kruskal–Wallis test (Teachers’ Recommendation vs Students gender) IV = Ave_sentiment*Emotional_valence, | |||||
|---|---|---|---|---|---|
| SET Data | Factor | Mean | Std. Dev | H (X2) | p-value (Sig.) |
| Male students ( | Ave_sentiment | − 0.006 | 0.074 | 15.238 | .124 |
| Emotional_valence | − 0.010 | 0.394 | 14.190 | .164 | |
| Female students ( | Ave_sentiment | − 0.005 | 0.076 | 35.389 | .000* |
| Emotional_valence | − 0.020 | 0.441 | 5.774 | .834 | |
IV independent variable
Significance levels p ≤ .05
Kruskal Wallis test for recommendation of teachers analyzed by students' and teachers' gender
| Kruskal Wallis test for Recommendation of the teachers (DV), analyzed by students’ gender vs gender of the teachers | ||||||
|---|---|---|---|---|---|---|
| Factor | Mean | Std. Dev | H (X2) | |||
| Male students ( | Male teachers ( | Ave_sentiment | − 0.006 | 0.069 | 8.644 | .566 |
| Emotional_valence | −0.010 | 0.402 | 17.953 | .056 | ||
| Female teachers ( | Ave_sentiment | − 0.007 | 0.080 | 16.889 | .077 | |
| Emotional_valence | − 0.020 | 0.381 | 9.661 | .471 | ||
| Female students ( | Male teachers ( | Ave_sentiment | − 0.004 | 0.070 | 23.425 | .009* |
| Emotional_valence | − 0.020 | 0.446 | 8.594 | .571 | ||
| Female teachers ( | Ave_sentiment | − 0.007 | 0.083 | 29.717 | .001* | |
| Emotional_valence | − 0.030 | 0.435 | 11.151 | .346 | ||
DV dependent variable, IV independent variable
Significance levels p ≤ .05
Confusion matrix (performance metrics) for optimal k-value, knn = 31
| Predicted scores | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Actual scores | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 1 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 2 | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 | 0 | 0 | 0 | 8 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 4 | 0 | 0 | 0 | 0 | 9 | 2 | 0 | 0 | 0 | 0 | 0 |
| 5 | 0 | 0 | 0 | 0 | 0 | 15 | 0 | 0 | 0 | 0 | 0 |
| 6 | 0 | 0 | 0 | 0 | 0 | 0 | 9 | 0 | 0 | 0 | 0 |
| 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 | 0 | 0 | 0 |
| 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 | 0 | 0 |
| 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 | 0 |
| 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 165 |
Accuracy = 1.00 (100%), 95% CI = (0.98, 1), p <.00
Optimal KNN model (knn.31) classifier and statistics for the different classes of recommendation (scores) by the students
| knn.31 model classifier statistics and performance metrics for the optimal | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Performance criteria | Score0 | Score1 | Score2 | Score3 | Score4 | Score5 | Score6 | Score7 | Score8 | Score9 | Score10 | Ave_met (knn.31) |
| Recall (sensitivity) | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Specificity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Precision (pos. pred. value) | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Negative pred. value | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Prevalence of score | 0.03 | 0.02 | 0.03 | 0.02 | 0.03 | 0.05 | 0.03 | 0.05 | 0.08 | 0.09 | 0.55 | 0.08 |
| Detection rate | 0.03 | 0.02 | 0.03 | 0.02 | 0.03 | 0.05 | 0.03 | 0.05 | 0.08 | 0.09 | 0.55 | 0.08 |
| Detection prevalence | 0.03 | 0.02 | 0.03 | 0.02 | 0.03 | 0.05 | 0.03 | 0.05 | 0.08 | 0.09 | 0.55 | 0.08 |
| Balanced accuracy | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Values are represented in percentage (%) where 0.00 equals 0% and 1.00 equals 100%
K-fold cross-validation of the KNN models used for predicting the recommendation scores by the students
| K-fold Cross validation method and results of the KNN models classifications and performance metrics | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Performance criteria | knn.26 | knn.27 | knn.28 | knn.29 | knn.30 | knn.31 | knn.32 | knn.33 | knn.34 | knn.35 | knn.36 |
| Recall (sensitivity) | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.96 | 0.94 | 0.88 | 0.88 |
| Specificity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.99 | 0.99 | 0.99 | 0.99 |
| Precision (positive pred. value) | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.95 | 1.00 | 1.00 | 1.00 |
| Negative pred. value | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.99 | 0.99 | 0.99 | 0.99 |
| Prevalence of score | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 |
| Detection rate | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 |
| Detection prevalence | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 |
| Balanced accuracy | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.98 | 0.97 | 0.94 | 0.94 |
| .000 | .000 | .000 | .000 | .000 | .000 | .000 | .000 | .000 | .000 | .000 | |
| Cohen’s Kappa (expected at 0.9) | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.98 | 0.95 | 0.91 | 0.91 |
| Conf. Int (CI): 95% | (0.98, 1) | (0.98, 1) | (0.98, 1) | (0.98, 1) | (0.98, 1) | (0.98, 1) | (0.98, 1) | (0.96, 0.99) | (0.94, 0.98) | (0.91, 0.96) | (0.91, 0.96) |
Values are represented in percentage (%) where 0.00 equals 0% and 1.00 equals 100%