| Literature DB >> 30964869 |
Yasir Tahir1, Zixu Yang2, Debsubhra Chakraborty1, Nadia Thalmann1, Daniel Thalmann1, Yogeswary Maniam2, Nur Amirah Binte Abdul Rashid2, Bhing-Leet Tan2,3, Jimmy Lee Chee Keong2,4, Justin Dauwels5.
Abstract
Negative symptoms in schizophrenia are associated with significant burden and possess little to no robust treatments in clinical practice today. One key obstacle impeding the development of better treatment methods is the lack of an objective measure. Since negative symptoms almost always adversely affect speech production in patients, speech dysfunction have been considered as a viable objective measure. However, researchers have mostly focused on the verbal aspects of speech, with scant attention to the non-verbal cues in speech. In this paper, we have explored non-verbal speech cues as objective measures of negative symptoms of schizophrenia. We collected an interview corpus of 54 subjects with schizophrenia and 26 healthy controls. In order to validate the non-verbal speech cues, we computed the correlation between these cues and the NSA-16 ratings assigned by expert clinicians. Significant correlations were obtained between these non-verbal speech cues and certain NSA indicators. For instance, the correlation between Turn Duration and Restricted Speech is -0.5, Response time and NSA Communication is 0.4, therefore indicating that poor communication is reflected in the objective measures, thus validating our claims. Moreover, certain NSA indices can be classified into observable and non-observable classes from the non-verbal speech cues by means of supervised classification methods. In particular the accuracy for Restricted speech quantity and Prolonged response time are 80% and 70% respectively. We were also able to classify healthy and patients using non-verbal speech features with 81.3% accuracy.Entities:
Mesh:
Year: 2019 PMID: 30964869 PMCID: PMC6456189 DOI: 10.1371/journal.pone.0214314
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample characteristics.
| Schizophrenia sample | Healthy Control Sample | ||
|---|---|---|---|
| Gender (male:female) | 25:29 | 12:14 | 0.990 |
| Age (years) | 31.06 ± 7.52 | 29.58 ± 8.09 | 0.424 |
| Total years of education | 13.67 ± 2.76 | 13.53 ± 2.23 | 0.825 |
| Duration of illness | 9.06 ± 7.36 | - | |
| Age of illness onset | 22.56 ± 5.30 | - | |
| Medication (%) | |||
| Antipsychotics | 94.44 | ||
| Typical antipsychotics | 7.41 | - | |
| Atypical antipsychotics | 81.48 | - | |
| Anticholinergics | 25.93 | - | |
| Antidepressants | 31.48 | - | |
| Mood Stabilizers | 25.93 | - | |
| Benzodiazepine | 14.81 | - | |
| CPZ equivalence (mg/day) | 412.19 ± 352.01 | - | |
| BPRS Total Score | 32.81 ± 8.86 | 19.81 ± 1.86 | < 0.001 |
| NSA Total Score | 41.28 ± 9.39 | 26.77 ± 3.77 | < 0.001 |
| NSA—Communication Domain Score | 7.96 ± 3.38 | 4.46 ± 0.71 | < 0.001 |
| NSA—Emotion Affect Domain Score | 8.65 ± 1.99 | 6.04 ± 1.73 | < 0.001 |
| NSA—Social Involvement Domain Score | 9.02 ± 2.67 | 7.04 ± 2.01 | < 0.001 |
| NSA—Motivation Domain Score | 11.85 ± 2.68 | 6.92 ± 1.94 | < 0.001 |
| NSA—Retardation Domain Score | 3.80 ± 1.81 | 2.31 ± 0.47 | < 0.001 |
CPZ = Chloropromazine;
BPRS = The Brief Psychiatric Rating Scale;
NSA = Negative Symptom Assessment
NSA items and their explanations.
| Label | Criteria | Explanation |
|---|---|---|
| NSA 1 | Prolonged time to respond | After asking the subject a question, he/she pauses for inappropriately long periods before answering |
| NSA 2 | Restricted speech quantity | Ratings on this item suggest that the subject gives brief answers to questions and/or provides elaborating details only after the interviewer prods him |
| NSA 3 | Impoverished speech content | The subject may talk a lot or a little but the information conveyed is very limited |
| NSA 4 | Inarticulate speech | The subject’s speech cannot be understood because enunciation is poor |
| NSA 5 | Emotion: Reduced range | Emotion is the feeling content of a person’s inner life. This item assesses the range of emotion experienced by the subject during the last week (or other specified time period) |
| NSA 6 | Affect: Reduced modulation of intensity | This item assesses the subject’s modulations of intensity of affect shown during the interview while discussing matters that would be expected to elicit significantly different affective intensities in a normal person |
| NSA 7 | Affect: Reduced display on demand | This items assesses the subject’s ability to display a range of affect as expressed by changes in his/her facial expression and gestures when asked by the interviewer to show how his/her face appears when he/she feels happy, sad, proud, scared, surprised, and angry |
| NSA 8 | Reduced social drive | This item assesses how much the subject desires to initiate social interactions. Desire may be measured in part by the number of actual or attempted social contacts with others |
| NSA 9 | Poor rapport with interviewer | This item assesses the interviewer’s subjective sense that he/she and the subject are actively engaged in communication with one another |
| NSA 10 | Interest in emotional and physical intimacy | This item assesses how much the subject retains interest in emotional and physical intimacy or sexual activity |
| NSA 11 | Poor grooming and hygiene | The subject presents with poorly groomed hair, dishevelled clothing, etc. |
| NSA 12 | Reduced sense of purpose | This item assesses whether the subject possesses integrated goals for his/her life |
| NSA 13 | Reduced interests | This item assesses the range and intensity of the subject’s interests |
| NSA 14 | Reduced daily activity | This item assesses the level of the subject’s daily activity and his/her failure to take advantage of the opportunities his/her environment offers |
| NSA 15 | Reduced expressive gestures | Gestures and body movements that normally facilitate communication during speech are less than normal, or are not observed at all |
| NSA 16 | Slowed movements | This item assesses how much the subject’s voluntary movements are slowed. At a minimum, one should rate movements as gait and those of rising from a chair |
| NSA 17 | Global negative symptoms rating | This item assesses the overall impression of negative symptoms in the subject |
| NSA 18 | NSA total | Sum of the ratings from questions 1-16 |
| NSA 19 | NSA communication | Sum of the ratings from questions 1-4 |
| NSA 20 | NSA emotion affect | Sum of the ratings from questions 5-7 |
| NSA 21 | NSA social involvement | Sum of the ratings from questions 8-10 |
| NSA 22 | NSA motivation | Sum of the ratings from questions 11-14 |
| NSA 23 | NSA retardation | Sum of the ratings from questions 15-16 |
List of conversational, and prosodic features.
| Category | Features |
|---|---|
| Speaking duration | Speaking %, Mutual silence, Difference in Speaking %, Overlap, Response time |
| Speaking turns | Natural turns, Turn duration |
| Interruption | Interruptions, Failed interruptions |
| Interjection | Interjection, Speaking interjection |
| Frequencies | Larynx frequency (F0), Formant (F1, F2, F3) |
| MFCC | Mel-frequency cepstral coefficients |
| Amplitude | Mean volume, Max volume, Min volume, Entropy |
Fig 1Illustration of conversational cues or features.
Periods of speaking and non-speaking are indicated in black and white respectively.
Explanation of non-verbal conversational cues.
| Non-Verbal Feature | Description |
|---|---|
| Natural Turn-Taking | The number of times person ‘A’ speaks in the conversation without interrupting person ‘B’ (see |
| Turn Duration | The average duration of a speaker’s turn. |
| Speaking % | The percentage of time a person speaks in the conversation. |
| Speaking % Difference | The difference between the speaking percentages of both speakers. |
| Mutual Silence % | The percentage of time when both participants are silent. |
| Interruption | Person ‘A’ interrupts person ‘B’ while speaking, and takes over. Person ‘B’ stops speaking before person ‘A’ does (see |
| Speaking Interjection | Short utterances such as ‘okay’, ‘hmm’ etc. when other speaker is speaking (see |
| Speech Gap | The gap that a person takes between his/her consecutive turns. |
| Response Time | If person ‘A’ finishes speaking, then the time taken for person ‘B’ to start speaking is called response time. |
Fig 2Colormap plots of NSA-16 ratings.
Colormap plots between (a) NSA-16 features 1-9 and conversational features, (b) NSA-16 features 10-16 and conversational features, and (c) NSA-16 features 17-23 and conversational features.
Correlation values between NSA and speech features.
| Natural Turn | Difference Turn | Interject | Speaking Interject | Interrupt | Failed Interrupt | Overlap | Speaking | Difference Speaking | Turn Duration | Speaking Rate | Mutual Silence | Speech Gap | Response Time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NSA 1 | -0.074 | -0.039 | -0.229 | 0.195 | 0.241 | 0.091 | 0.169 | -0.081 | -0.076 | -0.174 | -0.155 | 0.117 | 0.245 | 0.307 |
| (0.593) | (0.782) | (0.096) | (0.158) | (0.079) | (0.515) | (0.223) | (0.563) | (0.584) | (0.209) | (0.265) | (0.399) | (0.075) | (0.024) | |
| NSA 2 | -0.276 | -0.176 | 0.05 | 0.025 | 0.154 | -0.035 | 0.098 | -0.437 | -0.178 | -0.461 | 0.135 | 0.404 | 0.149 | 0.355 |
| (0.044) | (0.204) | (0.718) | (0.859) | (0.267) | (0.803) | (0.482) | (0.001) | (0.197) | (0.001) | (0.331) | (0.002) | (0.283) | (0.008) | |
| NSA 3 | -0.044 | -0.055 | -0.309 | 0.177 | 0.277 | 0.246 | 0.296 | -0.027 | -0.102 | -0.227 | -0.125 | 0.096 | 0.129 | 0.29 |
| (0.754) | (0.691) | (0.023) | (0.199) | (0.043) | (0.073) | (0.03) | (0.844) | (0.463) | (0.099) | (0.366) | (0.491) | (0.351) | (0.033) | |
| NSA 4 | -0.195 | -0.205 | -0.265 | -0.016 | 0.088 | 0.137 | 0.255 | -0.163 | -0.24 | -0.2 | -0.146 | 0.119 | 0.054 | 0.216 |
| (0.158) | (0.137) | (0.053) | (0.908) | (0.527) | (0.324) | (0.063) | (0.238) | (0.081) | (0.147) | (0.293) | (0.39) | (0.697) | (0.116) | |
| NSA 5 | -0.005 | 0.132 | 0.138 | -0.14 | -0.112 | -0.105 | -0.121 | -0.081 | 0.16 | -0.061 | 0.185 | 0.172 | -0.051 | 0.192 |
| (0.973) | (0.34) | (0.318) | (0.311) | (0.42) | (0.448) | (0.383) | (0.559) | (0.247) | (0.66) | (0.18) | (0.214) | (0.715) | (0.164) | |
| NSA 6 | -0.044 | -0.048 | -0.108 | 0.074 | 0.144 | -0.012 | 0.126 | -0.172 | -0.165 | -0.323 | 0.237 | 0.157 | 0.153 | 0.226 |
| (0.752) | (0.731) | (0.439) | (0.596) | (0.3) | (0.933) | (0.362) | (0.213) | (0.232) | (0.017) | (0.085) | (0.258) | (0.268) | (0.1) | |
| NSA 7 | -0.349 | -0.326 | 0.053 | 0.102 | 0.019 | 0.144 | 0.248 | -0.326 | -0.14 | -0.183 | 0.036 | 0.342 | 0.187 | 0.285 |
| (0.01) | (0.016) | (0.703) | (0.462) | (0.889) | (0.299) | (0.071) | (0.016) | (0.314) | (0.186) | (0.796) | (0.011) | (0.175) | (0.036) | |
| NSA 8 | -0.224 | -0.076 | -0.01 | 0.095 | -0.056 | -0.019 | 0.071 | -0.224 | -0.139 | -0.162 | -0.045 | 0.119 | 0.067 | 0.092 |
| (0.103) | (0.586) | (0.944) | (0.493) | (0.688) | (0.892) | (0.609) | (0.104) | (0.316) | (0.241) | (0.747) | (0.393) | (0.631) | (0.509) | |
| NSA 9 | -0.027 | 0.059 | -0.073 | -0.092 | 0.147 | 0.027 | 0.081 | -0.07 | 0.014 | -0.17 | -0.158 | 0.219 | 0.137 | 0.226 |
| (0.846) | (0.671) | (0.599) | (0.508) | (0.29) | (0.846) | (0.559) | (0.613) | (0.918) | (0.219) | (0.255) | (0.112) | (0.323) | (0.1) | |
| NSA 13 | -0.068 | -0.064 | -0.105 | 0.062 | 0.091 | 0.072 | 0.171 | -0.057 | -0.105 | -0.151 | -0.019 | 0.054 | -0.198 | 0.147 |
| (0.625) | (0.648) | (0.451) | (0.656) | (0.514) | (0.605) | (0.218) | (0.684) | (0.449) | (0.275) | (0.893) | (0.698) | (0.15) | (0.288) | |
| NSA 14 | -0.069 | -0.023 | 0.027 | 0.111 | 0.103 | 0.029 | 0.14 | -0.155 | -0.101 | -0.28 | -0.026 | 0.17 | 0.12 | 0.084 |
| (0.618) | (0.87) | (0.848) | (0.426) | (0.459) | (0.834) | (0.314) | (0.263) | (0.465) | (0.04) | (0.853) | (0.22) | (0.387) | (0.548) | |
| NSA 15 | -0.222 | -0.175 | -0.047 | 0.188 | 0.165 | 0.035 | 0.172 | -0.359 | -0.232 | -0.466 | 0.199 | 0.332 | 0.306 | 0.246 |
| (0.107) | (0.205) | (0.734) | (0.172) | (0.234) | (0.801) | (0.214) | (0.008) | (0.092) | (0.001) | (0.149) | (0.014) | (0.025) | (0.073) | |
| NSA 16 | -0.08 | -0.125 | -0.179 | 0.046 | 0.232 | 0.012 | 0.116 | -0.161 | -0.142 | -0.273 | -0.006 | 0.206 | 0.24 | 0.179 |
| (0.564) | (0.367) | (0.196) | (0.742) | (0.091) | (0.93) | (0.402) | (0.244) | (0.304) | (0.046) | (0.963) | (0.135) | (0.08) | (0.196) | |
| NSA Global | -0.242 | -0.168 | -0.082 | 0.099 | 0.188 | 0.018 | 0.161 | -0.273 | -0.171 | -0.311 | 0.02 | 0.271 | 0.118 | 0.161 |
| (0.077) | (0.225) | (0.557) | (0.476) | (0.173) | (0.898) | (0.244) | (0.046) | (0.215) | (0.022) | (0.886) | (0.047) | (0.396) | (0.246) | |
| NSA 18 | -0.24 | -0.157 | -0.131 | 0.109 | 0.248 | 0.105 | 0.263 | -0.301 | -0.217 | -0.432 | 0.027 | 0.307 | 0.168 | 0.369 |
| (0.08) | (0.258) | (0.345) | (0.435) | (0.071) | (0.452) | (0.055) | (0.027) | (0.115) | (0.001) | (0.845) | (0.024) | (0.225) | (0.006) | |
| NSA 19 | -0.201 | -0.157 | -0.259 | 0.148 | 0.278 | 0.151 | 0.282 | -0.248 | -0.2 | -0.377 | -0.099 | 0.264 | 0.215 | 0.42 |
| (0.144) | (0.256) | (0.058) | (0.287) | (0.041) | (0.277) | (0.039) | (0.07) | (0.147) | (0.005) | (0.477) | (0.054) | (0.118) | (0.002) | |
| NSA 20 | -0.181 | -0.112 | 0.03 | 0.021 | 0.032 | 0.012 | 0.122 | -0.271 | -0.076 | -0.274 | 0.22 | 0.31 | 0.14 | 0.329 |
| (0.19) | (0.419) | (0.828) | (0.879) | (0.816) | (0.933) | (0.379) | (0.048) | (0.585) | (0.045) | (0.111) | (0.023) | (0.313) | (0.015) | |
| NSA 21 | -0.221 | -0.112 | -0.075 | -0.051 | 0.145 | 0.03 | 0.12 | -0.21 | -0.173 | -0.254 | -0.043 | 0.188 | -0.026 | 0.236 |
| (0.108) | (0.42) | (0.592) | (0.712) | (0.296) | (0.832) | (0.386) | (0.128) | (0.21) | (0.063) | (0.759) | (0.174) | (0.851) | (0.086) | |
| NSA 22 | -0.107 | -0.036 | 0.002 | 0.131 | 0.197 | 0.118 | 0.239 | -0.118 | -0.129 | -0.281 | 0.014 | 0.11 | 0.022 | 0.115 |
| (0.44) | (0.796) | (0.99) | (0.345) | (0.154) | (0.394) | (0.081) | (0.395) | (0.352) | (0.039) | (0.922) | (0.427) | (0.875) | (0.406) | |
| NSA 23 | -0.186 | -0.178 | -0.121 | 0.147 | 0.225 | 0.029 | 0.171 | -0.317 | -0.224 | -0.443 | 0.127 | 0.321 | 0.321 | 0.251 |
| (0.179) | (0.198) | (0.385) | (0.29) | (0.103) | (0.834) | (0.216) | (0.02) | (0.104) | (0.001) | (0.359) | (0.018) | (0.018) | (0.067) |
Classification results for NSA criteria.
| NSA Criteria | Algorithm | Feature Selection | Confusion Matrix | Accuracy | AUC | Best Features | |
|---|---|---|---|---|---|---|---|
| Non-Observable | Observable | ||||||
| NSA 1 | SVM | Correlation | 33 | 4 | 79.6% | 0.74 | Ent_F2, Ent_F3, Ent_F1 |
| 7 | 10 | Speech_Gap, Response_Time | |||||
| NSA 2 | SVM | Correlation | 26 | 7 | 70.4% | 0.68 | Max_Vol, Mean_Vol, Ent_Freq |
| 9 | 12 | MFCC, Mutual_Silence | |||||
| NSA 3 | SVM | Correlation | 18 | 9 | 59.3% | 0.59 | MFCC2, Overlap, Response_Time |
| 13 | 14 | Failed_Interrupt, MFCC1 | |||||
| NSA 5 | SVM | Correlation | 13 | 12 | 53.7% | 0.54 | Ent_Vol, MFCC8, Max_Vol |
| 13 | 16 | MFCC, Mean_Vol | |||||
| NSA 6 | SVM | Correlation | 13 | 13 | 59.3% | 0.59 | Turn_Duration, Speech_Gap, MFCC5 |
| 9 | 19 | MFCC10, Response_Time | |||||
| NSA 8 | SVM | Correlation | 6 | 7 | 74.1% | 0.65 | MFCC2, Speaking, MFCC4 |
| 7 | 34 | MFCC3, MFCC8 | |||||
| NSA 15 | SVM | CFSsubset | 30 | 5 | 77.8% | 0.74 | Turn_Duration, Ent_Freq, Mutual_Silence |
| 7 | 12 | Max_Vol, Mean_Vol | |||||
Patient vs healthy classification.
| Algorithm | Feature Selection | Confusion Matrix | Accuracy | AUC | Precision | Recall | F-Score | Best Features | |
|---|---|---|---|---|---|---|---|---|---|
| Patient | Healthy | ||||||||
| SVM | ReliefF | 40 | 14 | 70% | 0.68 | 0.8 | 0.74 | 0.77 | speech_gap, ent_freq, mean_vol |
| 10 | 16 | ent_f1, ent_f3 | |||||||
| Random Forest | CFSsubset | 42 | 12 | 72.5% | 0.8 | 0.81 | 0.78 | 0.79 | speaking, speech_gap, ent_freq |
| 10 | 16 | ent_f3, mfcc | |||||||
| MLP | None | 44 | 10 | 81.3% | 0.9 | 0.9 | 0.82 | 0.85 | |
| 5 | 21 | ||||||||
| Ensemble (Bagging) | CFSsubset | 46 | 8 | 77.5% | 0.8 | 0.82 | 0.85 | 0.84 | speaking, speech_gap, ent_freq |
| 10 | 16 | ent_f3, mfcc | |||||||
Fig 3Boxplots for features that are significantly different for healthy subjects and patients.
Boxplots for (a) F1 Entropy, (b) F2 Entropy, (c) F3 Entropy, (d) Frequency Entropy, (e) Volume Entropy, and (f) Speaking Rate.