| Literature DB >> 26448738 |
Haijian Chen1, Dongmei Han2, Yonghui Dai3, Lina Zhao4.
Abstract
In recent years, Massive Open Online Courses (MOOCs) are very popular among college students and have a powerful impact on academic institutions. In the MOOCs environment, knowledge discovery and knowledge sharing are very important, which currently are often achieved by ontology techniques. In building ontology, automatic extraction technology is crucial. Because the general methods of text mining algorithm do not have obvious effect on online course, we designed automatic extracting course knowledge points (AECKP) algorithm for online course. It includes document classification, Chinese word segmentation, and POS tagging for each document. Vector Space Model (VSM) is used to calculate similarity and design the weight to optimize the TF-IDF algorithm output values, and the higher scores will be selected as knowledge points. Course documents of "C programming language" are selected for the experiment in this study. The results show that the proposed approach can achieve satisfactory accuracy rate and recall rate.Entities:
Mesh:
Year: 2015 PMID: 26448738 PMCID: PMC4584052 DOI: 10.1155/2015/123028
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The general process of text mining.
Figure 2The frameworks of automatic knowledge points' extraction.
“Knowledge point-teaching content” matrix.
| Knowledge | Teaching content 1 | Teaching content 2 | ⋯ | Teaching content m1 |
|---|---|---|---|---|
| Constant | 2 | 1 | ⋯ | 0 |
| Variable | 8 | 3 | ⋯ | 1 |
| Integer | 3 | 2 | ⋯ | 1 |
| Float | 1 | 1 | ⋯ | 0 |
| Array | 0 | 0 | ⋯ | 0 |
| Function | 0 | 0 | ⋯ | 0 |
| Style | 0 | 6 | 0 | |
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
Figure 3Result of word segmentation and POS tagging.
The results of accuracy rate of two level knowledge points.
| Parameters | The 1st level of knowledge points | The 2nd level of knowledge points |
|---|---|---|
| The expert annotation number of knowledge points | 66 | 258 |
| Extract expert annotation number of knowledge points | 48 | 193 |
| Accuracy rate | 72.7% | 74.8% |
Figure 4The best different threshold value in 1st level of knowledge points.
Figure 5The best different threshold value in 2nd level of knowledge points.
The results contrast.
| The 1st level of | The 2nd level of | |||
|---|---|---|---|---|
| Index | knowledge points | knowledge points | ||
| TF-IDF | AECKP | TF-IDF | AECKP | |
| ExpertsMark | 66 | 66 | 258 | 258 |
| All | 80 | 80 | 250 | 250 |
| Correct | 31 | 48 | 121 | 193 |
| Precision | 47.0% | 72.7% | 46.9% | 74.8% |
| Recall | 38.8% | 60.0% | 48.4% | 77.2% |
|
| 42.5% | 65.7% | 47.6% | 76.0% |
Figure 6The partial educational ontology of C programming.