| Literature DB >> 35062654 |
Kazuyuki Matsumoto1, Manabu Sasayama2, Taiga Kirihara1.
Abstract
Currently, task-oriented dialogue systems that perform specific tasks based on dialogue are widely used. Moreover, research and development of non-task-oriented dialogue systems are also actively conducted. One of the problems with these systems is that it is difficult to switch topics naturally. In this study, we focus on interview dialogue systems. In an interview dialogue, the dialogue system can take the initiative as an interviewer. The main task of an interview dialogue system is to obtain information about the interviewee via dialogue and to assist this individual in understanding his or her personality and strengths. In order to accomplish this task, the system needs to be flexible and appropriate for detecting topic switching and topic breaks. Given that topic switching tends to be more ambiguous in interview dialogues than in task-oriented dialogues, existing topic modeling methods that determine topic breaks based only on relationships and similarities between words are likely to fail. In this study, we propose a method for detecting topic breaks in dialogue to achieve flexible topic switching in interview dialogue systems. The proposed method is based on multi-task learning neural network that uses embedded representations of sentences to understand the context of the text and utilizes the intention of an utterance as a feature. In multi-task learning, not only topic breaks but also the intention associated with the utterance and the speaker are targets of prediction. The results of our evaluation experiments show that using utterance intentions as features improves the accuracy of topic separation estimation compared to the baseline model.Entities:
Keywords: dialogue analysis; interview dialogue; topic segmentation
Mesh:
Year: 2022 PMID: 35062654 PMCID: PMC8780003 DOI: 10.3390/s22020694
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Speech intention tag.
| Main Tag | Sub-Tag | Sub-Tag 2 |
|---|---|---|
| “q”: Question | Pre-arranged question (“q1”) | Nest (“nest”) |
| Opinion (“q2”) | ||
| Others (“q3”) | ||
| “a”: Answer | Agreement/Disagreement (“a1”) | Nest (“nest”) |
| Self-disclosure (opinion) (“a2”) | ||
| Self-disclosure (others) (“a3”) | ||
| Explanation of another person (opinion) (“a4”) | ||
| Explanation of another person (others) (“a5”) | ||
| “f”: Feedback | Compliment (“f1”) | Empathy (“kyo”) |
| Help (“f2”) | ||
| Asking back/Repetition (“f3”) | ||
| Others (“f4”) | ||
| “c”: Comment | Compliment (“c1”) | Empathy (“kyo”) |
| Others (“c2”) | ||
| “b1”: Explanation to audience | ||
| “b2”: Filler | ||
| “b3”: Laughter | ||
| “b4”: Greeting |
Breakdown of interview dialogue corpus.
| Main Tag | Interviewer | Interviewee |
|---|---|---|
| q | 2319 | 151 |
| a | 337 | 6442 |
| f | 2334 | 393 |
| c | 1548 | 3 |
| b1 | 401 | 6 |
| b2 | 42 | 96 |
| b3 | 173 | 118 |
| b4 | 160 | 238 |
Breakdown of topic breaks for each utterance speaker.
| Speaker | Label (t) | Rate | Label (n) | Rate | Label(t)+Label(n) |
|---|---|---|---|---|---|
| Interviewer | 753 | 0.10 | 6749 | 0.90 | 7502 |
| Interviewee | 84 | 0.01 | 7684 | 0.99 | 7768 |
Figure 1Distribution of similarity for topic sections. (a) Histogram of similarity for adjacent topic sections, (b) Histogram of similarity for all topic sections.
Comparison of word sets between similar adjacent topics.
| Similarity | t-1 | t |
|---|---|---|
| 0.35 | player, | |
| 0.34 | son, | language, but, news, anxiety stage, group, |
Figure 2Two-dimensional plot of feature vector; BERT, SBERT, and DBERT.
Figure 3Input features for binary/three-valued classifier.
Figure 4Triple-task multi-input neural networks for dialogue topic segmentation task.
Training parameters for each model.
| Model | Parameters | Value |
|---|---|---|
| Neural Networks | epochs | 5–100 |
| optimizer | Adam | |
| loss function | categorical crossentropy | |
| SVM | kernel | rbf |
| regularization parameter C | 1.0 | |
| class weight | balanced | |
| Logistic Regression | default parameters | |
| Random Forest | default parameters | |
Experimental results (accuracy).
| Tag | Output (bin/tri) | Task | Feature | Epochs | Accuracy |
|---|---|---|---|---|---|
| without | bin | triple | DBERT | 70 |
|
| multi | DBERT | 90 |
| ||
| single | DBERT | 30 | 0.78 | ||
| tri | triple | DBERT | 90 |
| |
| multi | BERT | 20 | 0.69 | ||
| single | DBERT | 40 | 0.69 | ||
| with | bin | triple | DBERT | 20 | 0.83 |
| multi | DBERT | 20 |
| ||
| single | DBERT | 10 | 0.83 | ||
| tri | triple | DBERT | 20 |
| |
| multi | DBERT | 20 |
| ||
| single | BERT | 10 | 0.75 |
Experimental results (F1 of true label).
| Tag | Output (bin/tri) | Task | Feature | Epochs | F1-score |
|---|---|---|---|---|---|
| without | bin | triple | DBERT | 70 |
|
| multi | DBERT | 80 |
| ||
| single | DBERT | 50 | 0.77 | ||
| tri | triple | BERT | 30 |
| |
| multi | BERT | 20 | 0.73 | ||
| single | BERT | 5 | 0.73 | ||
| with | bin | triple | DBERT | 20 |
|
| multi | DBERT | 20 |
| ||
| single | DBERT | 20 |
| ||
| tri | triple | DBERT | 20 | 0.785 | |
| multi | DBERT | 30 |
| ||
| single | DBERT | 20 | 0.79 |
Accuracy of baseline methods.
| Tag | Output (bin/tri) | Model | Feature | Accuracy |
|---|---|---|---|---|
| without | bin | SVM | BERT |
|
| Logistic Regression | BERT | 0.76 | ||
| Random Forest | BERT | 0.78 | ||
| tri | SVM | BERT |
| |
| Logistic Regression | BERT | 0.75 | ||
| Random Forest | BERT | 0.74 | ||
| with | bin | SVM | BERT |
|
| Logistic Regression | TFIDF | 0.82 | ||
| Random Forest | TFIDF | 0.81 | ||
| tri | SVM | BERT |
| |
| Logistic Regression | BERT |
| ||
| Random Forest | BERT | 0.81 |
F1-score of baseline methods.
| Tag | Output (bin/tri) | Model | Feature | F1-score |
|---|---|---|---|---|
| without | bin | SVM | BERT |
|
| Logistic Regression | BERT | 0.76 | ||
| Random Forest | BERT | 0.79 | ||
| tri | SVM | BERT |
| |
| Logistic Regression | BERT | 0.75 | ||
| Random Forest | BERT | 0.72 | ||
| with | bin | SVM | BERT |
|
| Logistic Regression | DBERT | 0.82 | ||
| Random Forest | BERT | 0.80 | ||
| tri | SVM | BERT |
| |
| Logistic Regression | BERT |
| ||
| Random Forest | BERT | 0.80 |
Comparison of selected features.
| Feature | SVM | Random Forest | Logistic Regression |
|---|---|---|---|
| tf-idf | |||
| tf-idf + speech intention tag vector | you, | yes, everyone, you, |