| Literature DB >> 29928017 |
Mengyun Cao1,2, Xiaoping Sun2, Hai Zhuge1,2,3.
Abstract
The Semantic Link Network is a general semantic model for modeling the structure and the evolution of complex systems. Various semantic links play different roles in rendering the semantics of complex system. One of the basic semantic links represents cause-effect relation, which plays an important role in representation and understanding. This paper verifies the role of the Semantic Link Network in representing the core of text by investigating the contribution of cause-effect link to representing the core of scientific papers. Research carries out with the following steps: (1) Two propositions on the contribution of cause-effect link in rendering the core of paper are proposed and verified through a statistical survey, which shows that the sentences on cause-effect links cover about 65% of key words within each paper on average. (2) An algorithm based on syntactic patterns is designed for automatically extracting cause-effect link from scientific papers, which recalls about 70% of manually annotated cause-effect links on average, indicating that the result adapts to the scale of data sets. (3) The effects of cause-effect link on four schemes of incorporating cause-effect link into the existing instances of the Semantic Link Network for enhancing the summarization of scientific papers are investigated. The experiments show that the quality of the summaries is significantly improved, which verifies the role of semantic links. The significance of this research lies in two aspects: (1) it verifies that the Semantic Link Network connects the important concepts to render the core of text; and, (2) it provides an evidence for realizing content services such as summarization, recommendation and question answering based on the Semantic Link Network, and it can inspire relevant research on content computing.Entities:
Mesh:
Year: 2018 PMID: 29928017 PMCID: PMC6013162 DOI: 10.1371/journal.pone.0199303
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The general architecture of this research.
The experimental datasets used in this paper and in our previous work.
| Dataset Name | Brief Description | Paper | Annotated Paper | Annotations |
|---|---|---|---|---|
| The journal paper dataset used in [ | 13 | 3 | ||
| The expanded journal paper dataset | 39 | 9 | 1. | |
| Annotated papers within the | 9 | 9 | 1. | |
| The conference paper dataset collected from proceedings of ACL 2014 | 173 | 0 | (No manually annotation) |
The intensity of annotated cause-effect links within each paper of the OBSERVATION dataset.
| Article ID | S_num | CE_num | CE_rate |
|---|---|---|---|
| f0001 | 712 | 92 | 7.74 |
| f0002 | 167 | 28 | 5.96 |
| f0003 | 1106 | 145 | 7.63 |
| f0014 | 102 | 36 | 2.83 |
| f0015 | 194 | 34 | 5.71 |
| f0016 | 162 | 35 | 4.63 |
| f0027 | 96 | 23 | 4.17 |
| f0028 | 279 | 48 | 5.81 |
| f0029 | 241 | 36 | 6.69 |
| 339.89 | 53 | 5.69 |
The distribution of cause-effect link on the sections of paper f0001.
| Section Title | Sentence Number | Cover Rate (%) |
|---|---|---|
| Abstract | 12 | 25 |
| 1. Introduction | 118 | 9.32 |
| 2. Multi-Dimensional Methodology | 36 | 5.56 |
| 3. Basic Characteristics and Principles … | 70 | 98.60 |
| 4. General Citation–Definition … | 81 | 27.16 |
| 5. Dimension of Representation | 99 | 14.14 |
| 6. Multi-Dimensional Evaluation | 38 | 23.68 |
| 7. Incorporating pictures into summary | 77 | 25.97 |
| 8. Summarizing Videos, Graphs and Pictures | 75 | 21.33 |
| 9. General Summarization | 93 | 21.51 |
| 10. Conclusion | 13 | 7.70 |
The key words coverage of the annotated cause-effect links.
| Article ID | Abstract (%) | Conclusion (%) | Abs&Conc (%) |
|---|---|---|---|
| f0001 | 84.17 | 69.84 | 76.83 |
| f0002 | 79.03 | 54.12 | 64.63 |
| f0003 | 85.03 | 87.23 | 86.39 |
| f0014 | 0 | 0 | 0 |
| f0015 | 0 | 74.29 | 74.29 |
| f0016 | 58.21 | 0 | 58.21 |
| f0027 | 38.89 | 0 | 38.89 |
| f0028 | 0 | 74.04 | 74.04 |
| f0029 | 50 | 43.82 | 46.99 |
| Average | 65.89 | 67.29 | 65.07 |
The position of cause component and the effect component within the annotated cause-effect links.
| Article ID | Adjacent (%) | Not-adj&Multi (%) |
|---|---|---|
| f0001 | 80.43 | 19.57 |
| f0002 | 96.43 | 3.57 |
| f0003 | 93.10 | 6.90 |
| f0014 | 91.67 | 8.33 |
| f0015 | 88.24 | 11.76 |
| f0016 | 91.43 | 8.57 |
| f0027 | 91.30 | 8.70 |
| f0028 | 89.58 | 10.42 |
| f0029 | 94.44 | 5.56 |
| Average | 90.74 | 9.26 |
The percentage of the annotated cause-effect links containing causal cues.
| Article ID | Have causal cue (%) | No causal cue (%) |
|---|---|---|
| f0001 | 90.22 | 9.78 |
| f0002 | 89.29 | 10.71 |
| f0003 | 93.79 | 6.21 |
| f0014 | 88.89 | 11.11 |
| f0015 | 76.47 | 23.53 |
| f0016 | 85.71 | 14.29 |
| f0027 | 73.91 | 26.09 |
| f0028 | 81.25 | 18.75 |
| f0029 | 88.89 | 11.11 |
| Average | 85.38 | 14.62 |
The performance of the cause-effect link extraction algorithm on the OBSERVATION dataset.
| Article ID | Precision (%) | Recall (%) | F1-score |
|---|---|---|---|
| f0001 | 42.36 | 85.92 | 56.74 |
| f0002 | 45.24 | 76 | 56.72 |
| f0003 | 39.52 | 75.97 | 51.99 |
| f0014 | 42.5 | 56.67 | 48.57 |
| f0015 | 40 | 80 | 53.33 |
| f0016 | 37.5 | 60 | 46.15 |
| f0027 | 50 | 52.94 | 51.43 |
| f0028 | 48 | 64.86 | 55.17 |
| f0029 | 77.42 | 77.42 | 77.42 |
| Average | 46.95 | 69.98 | 55.28 |