| Literature DB >> 33039266 |
Varsha D Badal1, Sarah A Graham1, Colin A Depp2, Kaoru Shinkawa3, Yasunori Yamada3, Lawrence A Palinkas4, Ho-Cheol Kim5, Dilip V Jeste6, Ellen E Lee7.
Abstract
OBJECTIVE: The growing pandemic of loneliness has great relevance to aging populations, though assessments are limited by self-report approaches. This paper explores the use of artificial intelligence (AI) technology to evaluate interviews on loneliness, notably, employing natural language processing (NLP) to quantify sentiment and features that indicate loneliness in transcribed speech text of older adults.Entities:
Keywords: Artificial Intelligence; gender; social isolation
Year: 2020 PMID: 33039266 PMCID: PMC7486862 DOI: 10.1016/j.jagp.2020.09.009
Source DB: PubMed Journal: Am J Geriatr Psychiatry ISSN: 1064-7481 Impact factor: 4.105
FIGURE 1Processing pipeline for the qualitative interview data. API: application programming interface; NLU: natural language understanding; Q1: Question 1 (“Do you ever feel lonely, and if so, how often?”); Q2: Question 2 (“What does loneliness feel like to you? What is your general mood during that time?”); Q3: Question 3 (“Why do you think others may feel lonely?”); TF-IDF: term frequency – inverse document frequency.
Sociodemographic and Clinical Data of the Interviewees by Sex
| Women | Men | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| N | Mean | SD | N | Mean | SD | t or X2 | df | p | Cohen's d | |
| Age at Visit (years) | 51 | 81.6 | 7.1 | 29 | 85.5 | 5.7 | −2.51 | 78 | 0.01 | −0.85 |
| Education (years) | 51 | 15.4 | 2.4 | 29 | 16.3 | 2.1 | −1.76 | 78 | 0.08 | −0.59 |
| Race (% Caucasian) | 90.2 | 93.1 | 0.20 | 1 | 0.66 | |||||
| Marital Status (% not single) | 37.3 | 51.7 | 1.58 | 1 | 0.21 | |||||
| Qualitative Lonely (% yes) | 52.9 | 31.0 | 3.58 | 1 | 0.06 | |||||
| Quantitatively Lonely (% yes) | 33.3 | 44.8 | 1.04 | 1 | 0.31 | |||||
| UCLA-3 Score | 51 | 36.5 | 9.4 | 29 | 38.7 | 11.2 | −0.92 | 78 | 0.36 | −0.30 |
| Emotional Support (ESS-E) | 51 | 2.8 | 0.4 | 29 | 2.5 | 0.5 | 2.15 | 78 | 0.04 | 0.69 |
| Instrumental Support (ESS-I) | 51 | 1.9 | 0.8 | 29 | 1.8 | 0.8 | 0.88 | 78 | 0.38 | 0.29 |
| Negative social interactions (ESS-NI) | 51 | 0.7 | 0.8 | 29 | 0.8 | 0.7 | −1.06 | 78 | 0.29 | −0.35 |
| Anxiety (BSIAS) | 51 | 1.6 | 2.6 | 28 | 1.4 | 1.5 | 0.33 | 77 | 0.75 | 0.12 |
| Depression (PHQ-9) | 48 | 2.8 | 3.6 | 27 | 3.0 | 3.6 | −0.31 | 73 | 0.76 | −0.11 |
BSIAS: Brief Symptom Inventory Anxiety Scale; ESS-E: Emotional Support Scale – Emotional Support score; ESS-I: Emotional Support Scale – Instrumental Support score; ESS-NI: Emotional Support Scale – Negative Interaction Score; PHQ-9: Patient Health Questionnaire 9-item; UCLA-3: UCLA Loneliness Scale (Version 3).
FIGURE 2Distribution of length of response to Question 1 by quantitatively assessed loneliness. Q1: Question 1 (“Do you ever feel lonely and if so, how often?”).
FIGURE 3Emotional composition of response to Question 1 (“Do you ever feel lonely and if so, how often?”) by (A) Qualitative loneliness and (B) Quantitative assessment of loneliness (UCLA-3 Score). Dashed lines in the middle of distribution indicate median (second quartile) and dotted lines indicates first and third quartiles in the distribution.
FIGURE 4Distribution of emotions (sadness, joy, fear, disgust, anger) in response to Question 1 (“Do you ever feel lonely and if so, how often?”) by sex. Dashed lines in the middle of distribution indicate median (second quartile) and dotted lines indicate first and third quartiles in the distribution.
Performance of Machine Learning (ML) Models (80–20 Split) in Predicting Qualitative Loneliness (Lonely versus Not Lonely)
| ML Model | AUC | F1 | Precision | Recall |
|---|---|---|---|---|
| Stack | 0.91 | 0.87 | 0.90 | 0.87 |
| SVM linear | 0.95 | 0.87 | 0.90 | 0.87 |
| ANN tanh | 0.93 | 0.87 | 0.90 | 0.87 |
| ANN ReLu | 0.93 | 0.87 | 0.90 | 0.87 |
| ANN Logistic | 0.95 | 0.87 | 0.90 | 0.87 |
| SVM RBF | 0.91 | 0.81 | 0.82 | 0.81 |
| SVM Polynomial | 0.88 | 0.81 | 0.87 | 0.81 |
| Random Forest | 0.91 | 0.81 | 0.87 | 0.81 |
| AdaBoost | 0.80 | 0.74 | 0.85 | 0.75 |
| Tree | 0.71 | 0.69 | 0.74 | 0.68 |
Notes: Qualitative loneliness was manually determined based on responses to Question 1. Input features included: five emotions (joy, fear, anger, disgust, sadness), Question 1 TF-IDF score, Question 2 TF-IDF score, Question 3 TF-IDF score, and sentiment for Question 1. Results depicted reflect the best of 10 runs. Stack includes (SVM Polynomial, KNN, Tree, AdaBoost, ANN ReLu, random forest). AUC: area under curve (performance measure); kNN: K-nearest neighbor (algorithm), k = 9; ReLu: rectified linear unit (activation function); RBF: radial basis function (kernel function); SVM: support vector machine (algorithm); tanh: hyperbolic tangent (activation function); TF-IDF: term frequency – inverse document frequency. Bold values indicate the best performing model.
The performance measures shown are average over classes and computed as documented Orange.
Performance of Machine Learning (ML) Models (80–20 Split) in Predicting Quantitative Loneliness (Lonely Versus Not Lonely)
| ML Model | AUC | F1 | Precision | Recall |
|---|---|---|---|---|
| Tree | 0.69 | 0.68 | 0.69 | 0.68 |
| Random Forest | 0.59 | 0.62 | 0.62 | 0.62 |
| AdaBoost | 0.61 | 0.62 | 0.62 | 0.62 |
| kNN | 0.60 | 0.55 | 0.55 | 0.56 |
| Stack | 0.58 | 0.53 | 0.54 | 0.56 |
| ANN Logistic | 0.60 | 0.53 | 0.54 | 0.56 |
| SVM RBF | 0.65 | 0.53 | 0.77 | 0.62 |
| SVM Polynomial | 0.69 | 0.53 | 0.77 | 0.62 |
| SVM Linear | 0.53 | 0.53 | 0.77 | 0.62 |
| ANN ReLu | 0.65 | 0.44 | 0.44 | 0.50 |
Notes: Quantitative loneliness was determined by total score on the UCLA Loneliness Scale (version 3): ≤40 = No/Low Loneliness and >40 as Lonely. Input features included: five emotions (joy, fear, anger, disgust, sadness), Question 1 TF-IDF score, Question 2 TF-IDF score, Question 3 TF-IDF score, and sentiment for Question 1. Results depicted reflect the best of 10 runs. Stack includes (SVM polynomial, KNN, Tree, AdaBoost, ANN ReLu, random forest). AUC: area under curve (performance measure); kNN: K-nearest neighbour (algorithm); ReLu: rectified linear unit (activation function); RBF: radial basis function (kernel function); SVM: support vector machine (algorithm); tanh: hyperbolic tangent (activation function); TF-IDF: term frequency – inverse document frequency. Bold values indicate the best performing model.
The performance measures shown are average over classes and computed as documented Orange.