| Literature DB >> 35060906 |
Anzar Abbas1, Bryan J Hansen2, Vidya Koesmahargyo1, Vijay Yadav1, Paul J Rosenfield3, Omkar Patil2, Marissa F Dockendorf2, Matthew Moyer2, Lisa A Shipley2, M Mercedez Perez-Rodriguez3, Isaac R Galatzer-Levy1,4.
Abstract
BACKGROUND: Machine learning-based facial and vocal measurements have demonstrated relationships with schizophrenia diagnosis and severity. Demonstrating utility and validity of remote and automated assessments conducted outside of controlled experimental or clinical settings can facilitate scaling such measurement tools to aid in risk assessment and tracking of treatment response in populations that are difficult to engage.Entities:
Keywords: computer vision; digital biomarkers; facial expressivity; negative symptoms; phenotyping; vocal acoustics
Year: 2022 PMID: 35060906 PMCID: PMC8817208 DOI: 10.2196/26276
Source DB: PubMed Journal: JMIR Form Res ISSN: 2561-326X
List of vocal acoustic variables extracted from audio files collected during participation in remote smartphone assessments and references to earlier work on their relevance in schizophrenia.
| Variable | Description |
|
| Volume of participant’s speech, measured in decibels, which was previously shown to be decreased in individuals with schizophrenia compared to healthy controls [ |
|
| Average fundamental frequency of participant speech in hertz, which has been shown to be higher in individuals with schizophrenia and decreases in response to treatment [ |
|
| SD in fundamental frequency in hertz, which has been shown to be greater in individuals with schizophrenia [ |
|
| Degree of irregularity in the frequency of the participant’s speech, measured in hertz, demonstrated to be higher in individuals with schizophrenia [ |
|
| Percentage of the audio file where participant speech was detected as opposed to silence; individuals with schizophrenia demonstrate increased pauses and variability in pause duration [ |
|
| Quantification of additive noise in the participant’s speech, which has been used to predict risk of psychosis, and has shown to be correlated with symptom severity in other neurological disorders such as Parkinson disease [ |
Figure 1Example screenshots from the smartphone assessment all study participants took for remote and automated collection of video and audio data. During each of the prompts, the app speaks the text displayed on the screen and awaits a verbal and visual response from the participant, all while recording video and audio from the front-facing camera and microphone. (A) Screen displayed before the participant begins the assessment. (B) Prompt for collection of free behavior in response to images, showing one example image. (C) Prompt for collection of evoked facial expression behavior. (D) Prompt for collection of evoked vocal expression behavior.
All variables described in Measurement of Digital Markers were calculated separately for distinct behaviors captured during the remote smartphone assessments. Each of the behaviors that were elicited and captured during the smartphone assessment and the digital markers calculated from those behaviors are listed here.
| Behavior | On-screen prompt | Digital markers measured |
| Free behavior |
Facial expressivity Fundamental frequency mean Fundamental frequency stdev Vocal jitter Harmonics to noise ratio Speech prevalence | |
| Evoked facial expression |
Facial expressivity | |
| Evoked vocal expression |
Fundamental frequency mean Fundamental frequency stdev Vocal jitter Harmonics to noise ratio Speech prevalence |
Correlation between vocal markers during evoked vocal expression and Positive and Negative Syndrome Scale (PANSS) score showed a relationship between vocal characteristics and schizophrenia symptom severity.
| Variable | Negative symptom severity | Positive symptom severity | General severity | Total | Vocal | Fundamental | Fundamental | Vocal | Speech | |
|
| ||||||||||
|
| Pearson | — |
|
|
|
|
|
|
|
|
|
| — |
|
|
|
|
|
|
|
| |
|
| ||||||||||
|
| Pearson | 0.452a | — |
|
|
|
|
|
|
|
|
| .045 | — |
|
|
|
|
|
|
| |
|
| ||||||||||
|
| Pearson | 0.572b | 0.806c | — |
|
|
|
|
|
|
|
| .008 | <.001 | — |
|
|
|
|
|
| |
|
| ||||||||||
|
| Pearson | 0.757c | 0.870c | 0.947c | — |
|
|
|
|
|
|
| <.001 | <.001 | <.001 | — |
|
|
|
|
| |
|
| ||||||||||
|
| Pearson | –0.091 | –0.250 | –0.088 | –0.152 | — |
|
|
|
|
|
| .71 | .90 | .72 | .64 | — |
|
|
|
| |
|
| ||||||||||
|
| Pearson | –0.436 | –0.068 | 0.098 | –0.090 | –0.081 | — |
|
|
|
|
| .07 | .78 | .83 | .71 | .74 | — |
|
|
| |
|
| ||||||||||
|
| Pearson | –0.644a | –0.253 | –0.218 | –0.373 | 0.475 | 0.577a | — |
|
|
|
| .02 | .30 | .37 | .70 | 0.10 | .02 | — |
|
| |
|
| ||||||||||
|
| Pearson | 0.563a | 0.229 | 0.122 | 0.293 | –0.176 | –0.695c | –0.823c | — |
|
|
| .02 | .52 | .93 | .34 | .79 | <.001 | <.001 | — |
| |
|
| ||||||||||
|
| Pearson | –0.470 | –0.247 | –0.292 | –0.362 | 0.611a | 0.043 | 0.781c | –0.373 | — |
|
| .06 | .61 | .23 | .38 | .03 | .86 | <.001 | .12 | — | |
|
| ||||||||||
|
| Pearson | –0.610a | –0.195 | –0.126 | –0.297 | 0.154 | 0.773c | 0.868c | –0.965c | 0.422 |
|
| .02 | .51 | .61 | .43 | .66 | <.001 | <.001 | <.001 | .07 | |
aP<.05.
bP<.01.
cP<.001.
Correlation between facial expressivity during evoked facial expression and the Positive and Negative Syndrome Scale score showed a relationship between facial affect and schizophrenia symptom severity.
| Variable | Facial expressivity | Negative symptom severity | Positive symptom severity | General severity | |||||
|
| |||||||||
|
| Pearson | — |
|
|
| ||||
|
| — |
|
|
| |||||
|
| |||||||||
|
| Pearson | –0.500a | — |
|
| ||||
|
| .04 | — |
|
| |||||
|
| |||||||||
|
| Pearson | –0.628b | 0.452a | — |
| ||||
|
| .01 | .045 | — |
| |||||
|
| |||||||||
|
| Pearson | –0.695b | 0.572b | 0.806c | — | ||||
|
| .009 | 0.008 | <.001 | — | |||||
|
| |||||||||
|
| Pearson | –0.714b | 0.757c | 0.870c | 0.947c | ||||
|
| .002 | <.001 | <.001 | <.001 | |||||
aP<.05.
bP<.01.
cP<.001.
Correlation between facial and vocal markers during free behavior and PANSS score showed a relationship between facial affect and vocal characteristics with schizophrenia symptom severity.
| Variable | Negative symptom severity | Positive symptom severity | General severity | Total | Facial | Vocal | Fundamental frequency | Fundamental frequency | Harmonics to noise ratio | Vocal jitter | |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | — |
|
|
|
|
|
|
|
|
| ||||||||||
|
| — |
|
|
|
|
|
|
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | 0.452a | — |
|
|
|
|
|
|
|
| ||||||||||
|
| .045 | — |
|
|
|
|
|
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | 0.572b | 0.806c | — |
|
|
|
|
|
|
| ||||||||||
|
| .008 | <.001 | — |
|
|
|
|
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | 0.757c | 0.870c | 0.947c | — |
|
|
|
|
|
| ||||||||||
|
| <.001 | <.001 | <.001 | — |
|
|
|
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | 0.142 | –0.113 | 0.090 | 0.056 | — |
|
|
|
|
| ||||||||||
|
| .56 | .64 | .83 | .82 | — |
|
|
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | –0.502a | –0.332 | –0.225 | –0.386 | 0.364 | — |
|
|
|
| ||||||||||
|
| .05 | .17 | .83 | .24 | .13 | — |
|
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | –0.606a | –0.288 | –0.268 | –0.428 | 0.184 | 0.935c | — |
|
|
| ||||||||||
|
| .04 | .81 | .27 | .48 | .45 | <.001 | — |
|
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | –0.304 | –0.189 | –0.127 | –0.225 | 0.179 | 0.581b | 0.529a | — |
|
| ||||||||||
|
| .24 | .61 | .61 | .50 | .46 | .009 | .02 | — |
|
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | –0.584a | –0.224 | –0.097 | –0.312 | 0.174 | 0.654b | 0.774c | 0.476a | — |
| ||||||||||
|
| .03 | .62 | .97 | .34 | .48 | .002 | <.001 | .04 | — |
| |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | 0.426 | 0.147 | 0.015 | 0.194 | –0.097 | –0.541a | –0.691b | –0.278 | –0.937c | — | ||||||||||
|
| .10 | .64 | .95 | .50 | .69 | .02 | .001 | .25 | <.001 | — | |||||||||||
|
| |||||||||||||||||||||
|
| Pearson | –0.567a | –0.260 | –0.261 | –0.403 | 0.161 | 0.869c | 0.923c | 0.260 | 0.575b | –0.510a | ||||||||||
|
| .03 | .66 | .98 | .30 | .51 | <.001 | <.001 | .28 | .01 | .03 | |||||||||||
aP<.05.
bP<.01.
cP<.001.