| Literature DB >> 35529577 |
Dongxuan Wang1, Dapeng Lian2, Yazhou Xing3, Shiying Dong1, Xinyu Sun1, Jia Yu1.
Abstract
To effectively improve students' performance and help educators monitor students' learning situations, many colleges are committed to establishing systems that explore the influencing factors and predict student academic performance. However, because different colleges have different situations, the previous research results may not be applicable to ordinary Chinese colleges. This paper has two main objectives: to analyze the fluctuation of Chinese ordinary college student academic performance and to establish systems to predict performance. First, according to previous research results and the current situation of Chinese college students, a questionnaire was designed to collect data. Second, the chi-square test was used to analyze the contents of the questionnaire and identify the main features. Third, taking the main features as input, four classification prediction models are established by machine learning. Some traits of the students who did not pass all the examinations were also discovered. It might help student counselors and educators to take targeted measures. The experiment shows that the support vector machine classifier (SVC) model has the best and most stable effect. The average recall rate, precision rate, and accuracy rate reached 82.83%, 86.18%, and 80.96%, respectively.Entities:
Keywords: analysis and prediction; college student academic performance; factors analysis; machine learning; the chi-square test
Year: 2022 PMID: 35529577 PMCID: PMC9072789 DOI: 10.3389/fpsyg.2022.881859
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Cross-tab.
| 0 | 1 | Total | |
|---|---|---|---|
| A |
|
|
|
| B |
|
|
|
| C |
|
|
|
| D |
|
|
|
| E |
|
|
|
| Total |
|
|
|
x indicates the frequency of students choosing option .
Confusion matrix.
| True value | Predict value | |
|---|---|---|
| P′ | N′ | |
| P | TP | FN |
| N | FP | TN |
Questions about metacognitive awareness.
| Number | Question |
|---|---|
| Q1 | Are you satisfied with your college entrance examination results? |
| Q2 | How do you feel about your sense of achievement during your freshman year of study? |
| Q3 | Do you think you received enough attention from your teachers during freshman year? |
| Q4 | How often did you feel depressed during your freshman year? |
| Q5 | How do you feel about the stress of your freshman year? |
| Q6 | How much did you absorb what your teacher taught you in class during your freshman year? |
Questions about learning participation skills.
| Number | Question |
|---|---|
| Q16 | What do you think of your schedule? |
| Q17 | How often do you preview before class? |
| Q18 | How often do you review after class? |
| Q19 | How much effective self-study time did you spend on average every day during your freshman year? |
| Q20 | How often do you ask your teacher questions after class? |
| Q21 | How many times did you participate in entrepreneurship and innovation competitions during your freshman year? |
| Q22 | How often did you skip classes or find an excuse to skip class during your freshman year? |
| Q23 | Where did you sit most of the time in the class during your freshman year? |
| Q24 | How long can you concentrate in class? (total 45 min) |
| Q25 | How often did you sleep in class during your freshman year? |
| Q26 | How often did you play with your mobile phone in class during your freshman year? |
| Q27 | How did you finish the homework assigned by the teacher during your freshman year? |
Partial dataset.
| Name | Q1 | Q2 | Q3 | Q4 | Q5 | …… | Q23 | Q24 | Q25 | Q26 | Q27 | Pass_or_fail |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Zhang Jiankun | B | A | A | B | C | … | B | B | A | B | C | 1 |
| Zhang Jinxing | B | B | B | C | B | … | B | B | B | B | B | 0 |
| Ma Xiao | C | B | B | C | A | … | B | B | B | A | A | 0 |
| Su Meiqi | A | B | B | B | C | … | C | C | B | C | B | 1 |
| Fu Haobin | C | B | B | B | A | … | B | B | B | B | B | 1 |
Figure 1An overall academic performance.
Cross-tab statistics of the Q1–Q5 questionnaires.
| Question | Q1 | Q2 | Q3 | Q4 | Q5 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Option | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
| A | 7 | 16 | 63 | 14 | 34 | 22 | 16 | 11 | 12 | 12 |
| B | 40 | 40 | 92 | 58 | 149 | 88 | 149 | 79 | 84 | 33 |
| C | 61 | 39 | 19 | 22 | – | – | 18 | 20 | 78 | 58 |
| D | 61 | 9 | 9 | 16 | – | – | – | – | 9 | 7 |
| E | 14 | 6 | – | – | – | – | – | – | – | – |
Expected frequency T.
| Question | Q1 | Q2 | Q3 | Q4 | Q5 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Option | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
| A | 14.37 | 8.63 | 48.09 | 28.91 | 34.98 | 21.02 | 16.86 | 10.14 | 14.99 | 9.01 |
| B | 49.97 | 30.03 | 93.69 | 56.31 | 148.02 | 88.98 | 142.40 | 85.60 | 73.08 | 43.92 |
| C | 62.46 | 37.54 | 25.61 | 15.39 | – | – | 23.73 | 14.27 | 84.94 | 51.06 |
| D | 43.72 | 26.28 | 15.61 | 9.39 | – | – | – | – | 9.99 | 6.01 |
| E | 12.49 | 7.51 | – | – | – | – | – | – | – | – |
Chi-square test results (per the chi-square test critical value).
| Q |
|
|
|
| 0.05 Critical value | Q |
|
|
|
| 0.05 Critical value |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 34.12 | 7.04E-07 | ** | 4 | 9.488 | 15 | 85.62 | 1.90E-18 | ** | 3 | 7.815 |
| 2 | 24.39 | 2.07E-05 | ** | 3 | 7.815 | 16 | 6.83 | 0.07751 | * | 3 | 7.815 |
| 3 | 0.02 | 0.883851 | – | 1 | 3.841 | 17 | 0.96 | 0.32532 | – | 1 | 3.841 |
| 4 | 4.62 | 0.099183 | – | 2 | 5.991 | 18 | 15.52 | 0.000425 | ** | 2 | 5.991 |
| 5 | 7.71 | 0.052331 | – | 3 | 7.815 | 19 | 7.83 | 0.049628 | * | 3 | 7.815 |
| 6 | 44.51 | 2.16E-10 | ** | 2 | 5.991 | 20 | 19.25 | 1.15E-05 | ** | 1 | 3.841 |
| 7 | 0.98 | 0.612043 | – | 2 | 5.991 | 21 | 0.20 | 0.654369 | – | 1 | 3.841 |
| 8 | 2.18 | 0.335724 | – | 2 | 5.991 | 22 | 63.19 | 1.89E-14 | ** | 2 | 5.991 |
| 9 | 4.41 | 0.10994 | – | 2 | 5.991 | 23 | 55.51 | 8.81E-13 | ** | 2 | 5.991 |
| 10 | 0.81 | 0.665165 | – | 2 | 5.991 | 24 | 32.67 | 3.78E-07 | ** | 3 | 7.815 |
| 11 | 45.34 | 7.80E-10 | ** | 3 | 7.815 | 25 | 4.39 | 0.110819 | – | 2 | 5.991 |
| 12 | 9.26 | 0.002339 | ** | 1 | 3.841 | 26 | 7.87 | 0.019511 | * | 2 | 5.991 |
| 13 | 16.96 | 0.000717 | ** | 3 | 7.815 | 27 | 58.20 | 2.29E-13 | ** | 2 | 5.991 |
| 14 | 4.91 | 0.177763 | – | 3 | 7.815 |
R represents the degree of correlation, and .
Figure 2The flow chart of dataset segmentation.
Figure 3The confusion matrix.
Model prediction results.
| Evaluation criterion | Models | 1 (%) | 2 (%) | 3 (%) | 4 (%) | 5 (%) | 6 (%) | Average (%) |
|
|---|---|---|---|---|---|---|---|---|---|
| Recall | LOG | 75.76 | 75.76 | 78.79 | 81.82 | 75.76 | 78.79 | 77.78 | 0.0247 |
| SVC | 81.82 | 81.82 | 81.82 | 84.85 | 81.82 | 84.85 | 82.83 | 0.0156 | |
| RFC | 81.82 | 69.70 | 75.76 | 75.76 | 72.73 | 78.79 | 75.76 | 0.0429 | |
| BNB | 78.79 | 75.76 | 78.79 | 75.76 | 72.73 | 75.76 | 76.27 | 0.0228 | |
| Accuracy | LOG | 85.23 | 86.36 | 84.09 | 87.50 | 82.95 | 84.09 | 85.04 | 0.0167 |
| SVC | 88.64 | 85.23 | 85.23 | 87.50 | 86.36 | 84.09 | 86.18 | 0.0167 | |
| RFC | 88.64 | 81.82 | 82.95 | 85.23 | 80.68 | 82.95 | 83.71 | 0.0285 | |
| BNB | 87.50 | 79.55 | 78.41 | 81.82 | 80.68 | 75.00 | 80.49 | 0.0415 | |
| Precision | LOG | 83.33 | 86.21 | 78.79 | 84.38 | 78.13 | 78.79 | 81.61 | 0.0346 |
| SVC | 87.10 | 79.41 | 79.41 | 82.35 | 81.82 | 75.68 | 80.96 | 0.0382 | |
| RFC | 86.67 | 73.33 | 76.67 | 82.35 | 75.00 | 76.47 | 78.42 | 0.0506 | |
| BNB | 86.67 | 71.43 | 68.43 | 75.76 | 75.00 | 64.10 | 73.57 | 0.0773 |
Figure 4The ROC curve.
Questions about environmental factors.
| Number | Question |
|---|---|
| Q7 | Where does your family (growth environment) come from? |
| Q8 | What is your parents’ highest educational degree? |
| Q9 | What do you think of the learning atmosphere in your dormitory? |
| Q10 | How do you feel about your relationship with your classmates and roommates? |
Questions about learning motivation.
| Number | Question |
|---|---|
| Q11 | How do you feel about your interest in computer science? |
| Q12 | What do you do when you have a problem with your studies during your freshman year? |
| Q13 | How useful do you think textbook knowledge will be in your future work? |
| Q14 | What do you think is your motivation to study? |
| Q15 | What are your plans for your future? |