| Literature DB >> 35808330 |
Samina Sarwat1, Naeem Ullah1, Saima Sadiq2, Robina Saleem1, Muhammad Umer3, Ala' Abdulmajid Eshmawi4, Abdullah Mohamed5, Imran Ashraf6.
Abstract
The availability of educational data obtained by technology-assisted learning platforms can potentially be used to mine student behavior in order to address their problems and enhance the learning process. Educational data mining provides insights for professionals to make appropriate decisions. Learning platforms complement traditional learning environments and provide an opportunity to analyze students' performance, thus mitigating the probability of student failures. Predicting students' academic performance has become an important research area to take timely corrective actions, thereby increasing the efficacy of education systems. This study proposes an improved conditional generative adversarial network (CGAN) in combination with a deep-layer-based support vector machine (SVM) to predict students' performance through school and home tutoring. Students' educational datasets are predominantly small in size; to handle this problem, synthetic data samples are generated by an improved CGAN. To prove its effectiveness, results are compared with and without applying CGAN. Results indicate that school and home tutoring combined have a positive impact on students' performance when the model is trained after applying CGAN. For an extensive evaluation of deep SVM, multiple kernel-based approaches are investigated, including radial, linear, sigmoid, and polynomial functions, and their performance is analyzed. The proposed improved CGAN coupled with deep SVM outperforms in terms of sensitivity, specificity, and area under the curve when compared with solutions from the existing literature.Entities:
Keywords: CGAN; SVM; educational data; predicting student performance; tutoring
Mesh:
Year: 2022 PMID: 35808330 PMCID: PMC9269278 DOI: 10.3390/s22134834
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Attributes of students from dataset.
| Attribute | Description | Domain |
|---|---|---|
| sex | student’s gender | binary: female or male |
| age | student’s age | numeric: from 15 to 22 |
| school | student’s school | binary: Gabriel Pereira or Mousinho da Silveira |
| address | student’s home address | binary: urban or rural |
| Pstatus | parent’s cohabitation status | binary: living together or apart |
| Medu | mother’s education | numeric: from 0 to 4 |
| Mjob | mother’s job | nominal |
| Fedu | father’s education | numeric: from 0 to 4 |
| Fjob | father’s job | nominal |
| guardian | student’s guardian | nominal: mother, father, or other |
| famsize | family size | binary: ≤3 or >3 |
| famrel | quality of family relationships | numeric: from 1—very bad to 5—excellent |
| reason | reason to choose this school | nominal: close to home, school reputation, course preference, or other |
| traveltime | home to school travel time | numeric: 1 ≤ 15 min, 2–15 to 30 min, 3–30 min. to 1 h or 4 ≥ 1 h |
| studytime | weekly study time | numeric: 1 ≤ 2 h, 2–2 to 5 h, 3–5 to 10 h or 4 ≥ 10 h |
| failures | number of past class failures | numeric: n if 1 ≤ n < 3, else 4 |
| schoolsup | extra educational school support | binary: yes or no |
| famsup | family educational support | binary: yes or no |
| activities | extra-curricular activities | binary: yes or no |
| paidclass | extra paid classes | binary: yes or no |
| internet | Internet access at home | binary: yes or no |
| nursery | attended nursery school | binary: yes or no |
| higher | wants to take higher education | binary: yes or no |
| romantic | with a romantic relationship | binary: yes or no |
| freetime | free time after school | numeric: from 1—very low to 5—very high |
| goout | going out with friends | numeric: from 1—very low to 5—very high |
| Walc | weekend alcohol consumption | numeric: from 1—very low to 5—very high |
| Dalc | workday alcohol consumption | numeric: from 1—very low to 5—very high |
| health | current health status | numeric: from 1—very bad to 5—very good |
| absences | number of school absences | numeric: from 0 to 93 |
| G1 | first period grade | numeric: from 0 to 20 |
| G2 | second period grade | numeric: from 0 to 20 |
| G3 | final grade | numeric: from 0 to 20 |
Figure 1Architectural diagram of existing CGAN.
Figure 2Architecture of existing InfoGAN model.
Figure 3Architecture of proposed improved CGAN model.
Figure 4Architecture of proposed deep SVM.
Performance of deep SVM without using improved CGAN.
| Tutoring Method | Sensitivity | Specificity | AUC |
|---|---|---|---|
| School tutoring | 91.8% | 91.8% | 90.9% |
| Home tutoring | 92.2% | 92.7% | 91.1% |
| Combined tutoring | 95.2% | 94.7% | 92.9% |
Performance of deep SVM when used with Improved CGAN.
| Tutoring Method | Sensitivity | Specificity | AUC |
|---|---|---|---|
| School tutoring | 95.1% | 95.1% | 93.2% |
| Home tutoring | 91.9% | 91.2% | 91.4% |
| Combined tutoring | 98.2% | 97.1% | 96.2% |
Performance of deep neural network models when used with Improved CGAN.
| Tutoring Method | Sensitivity | Specificity | AUC |
|---|---|---|---|
| CNN | |||
| School Tutoring | 92.5% | 94.3% | 92.1% |
| Home Tutoring | 89.9% | 90.3% | 90.7% |
| Combined Tutoring | 97.3% | 96.0% | 96.0% |
| LSTM | |||
| School Tutoring | 90.4% | 91.8% | 90.1% |
| Home Tutoring | 88.4% | 89.4% | 88.0% |
| Combined Tutoring | 96.5% | 94.3% | 92.7% |
Figure 5Performance of the proposed approach on school tutoring.
Figure 6Performance of the proposed approach on home tutoring.
Figure 7Performance of the proposed approach on combined tutoring.
Figure 8Performance of the proposed approach using multiple kernels and typical kernel functions.
Results of machine learning models for school tutoring.
| Model | Sensitivity | Specificity | AUC |
|---|---|---|---|
| RF | 91.3 | 90.7 | 91.5 |
| LR | 92.2 | 93.4 | 92.8 |
| ETC | 91.3 | 94.1 | 91.8 |
| GBM | 88.4 | 89.3 | 87.2 |
| SGD | 89.6 | 89.5 | 90.5 |
Results of machine learning models for home tutoring scenario.
| Model | Sensitivity | Specificity | AUC |
|---|---|---|---|
| RF | 88.3 | 88.2 | 88.1 |
| LR | 87.1 | 87.2 | 87.6 |
| ETC | 89.7 | 88.5 | 88.8 |
| GBM | 87.7 | 86.4 | 88.4 |
| SGD | 88.6 | 85.7 | 89.5 |
Performance of models regarding combined tutoring.
| Model | Sensitivity | Specificity | AUC |
|---|---|---|---|
| RF | 94.1 | 95.4 | 94.5 |
| LR | 93.4 | 94.3 | 94.6 |
| ETC | 94.7 | 94.5 | 93.8 |
| GBM | 90.2 | 90.4 | 91.1 |
| SGD | 95.8 | 94.2 | 93.5 |
Figure 9Performance comparison with existing state-of-the-art approaches. (Note that reference numbers in the figure are wrong).