| Literature DB >> 31480718 |
Joana M Barros1,2, Ruth Melia3, Kady Francis4, John Bogue5, Mary O'Sullivan6, Karen Young7, Rebecca A Bernert8, Dietrich Rebholz-Schuhmann9, Jim Duggan7.
Abstract
Annual suicide figures are critical in identifying trends and guiding research, yet challenges arising from significant lags in reporting can delay and complicate real-time interventions. In this paper, we utilized Google Trends search volumes for behavioral forecasting of national suicide rates in Ireland between 2004 and 2015. Official suicide rates are recorded by the Central Statistics Office in Ireland. While similar investigations using Google trends data have been carried out in other jurisdictions (e.g., United Kingdom, United Stated of America), such research had not yet been completed in Ireland. We compiled a collection of suicide- and depression-related search terms suggested by Google Trends and manually sourced from the literature. Monthly search rate terms at different lags were compared with suicide occurrences to determine the degree of correlation. Following two approaches based on vector autoregression and neural network autoregression, we achieved mean absolute error values between 4.14 and 9.61 when incorporating search query data, with the highest performance for the neural network approach. The application of this process to United Kingdom suicide and search query data showed similar results, supporting the benefit of Google Trends, neural network approach, and the applied search terms to forecast suicide risk increase. Overall, the combination of societal data and online behavior provide a good indication of societal risks; building on past research, our improvements led to robust models integrating search query and unemployment data for suicide risk forecasting in Ireland.Entities:
Keywords: Google Trends; Ireland; autoregression; forecasting; neural networks; suicide
Mesh:
Year: 2019 PMID: 31480718 PMCID: PMC6747463 DOI: 10.3390/ijerph16173201
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Search queries gathered. The first two columns indicate search queries related to “suicide” and “depression”, suggested from Google Trends. The third column represents the terms cited in Tran et al. Additional terms are present in the last column.
| Suicide as Root Term | Depression as Root Term | Terms from | Additional Terms |
|---|---|---|---|
| suicidal | anxiety | commit suicide | suicide |
| suicide methods | signs of depression | i want to die | depression |
| how to commit suicide | symptoms of depression | suicidality | baby blues |
| postnatal depression | suicide attempt | feeling down | |
| depression and anxiety | suicide forum | ||
| what is depression | suicidal ideation | ||
| depressed | suicidal thoughts | ||
| post natal depression | suicide hotline | ||
| clinical depression | how to hang yourself | ||
| manic depression | how to kill yourself | ||
| how to help depression | |||
| severe depression | |||
| postpartum depression | |||
| how to deal with depression |
Figure 1Irish official suicide statistics and Irish unemployment records. The remaining y-axis scales represent the number of occurrences. The x-axis represents the date in respect of each data point.
Normality analysis results for official suicides figures. The results suggest that the data follows a normal distribution. The significance threshold used was 0.05.
| Minimum | Maximum | Mean | Skewness | Excess Kurtosis | Jarque Bera | |
|---|---|---|---|---|---|---|
| 19 | 64 | 41 | 0.02 | −0.27 | 0.46 | 0.8 |
Figure 2Correlation coefficients for Irish suicide data, Google search queries, and unemployment at different lags. The correlation coefficient value-color correspondence is represented on the bar on the right. Queries with a correlation not statistically significant are omitted; these include “suicidal”, “how to kill yourself”, “painless suicide”, “suicide forum”, “how to hang yourself”, “signs of depression”, “severe depression”, “post natal depression”.
Figure 3Correlation coefficients for Irish suicide data, Google search queries, and unemployment. The correlation coefficient between the 34 features and the official suicide figures are represented using a lag of 24 months. The correlation coefficient value-color correspondence is represented on the bar on the right. Cells without a color are not statistically significant.
Features utilized in the reduced and GU models. The models also include historical suicide data.
| Model | Features |
|---|---|
| Reduced | Historical suicide occurrences data , “suicide”, “depression”, “anxiety”, “suicide methods”, “suicidal”, “how to commit suicide”, and “postnatal depression”. |
| Google + Unemployment | Historical suicide occurrences data, “suicide”, “depression”, “suicidal”, “suicide methods”, “how to commit suicide”, “anxiety”, “postnatal depression”, “signs of depression”, “symptoms of depression”, “depression and anxiety”, “depressed”, “post natal depression”, “manic depression”, “how to help depression”, “severe depression”, “how to deal with depression”, “baby blues”, “feeling down”, “commit suicide”, “how to kill yourself”, “I want to die”, “suicidality”, “suicide attempt”, “suicide forum”, “suicidal ideation”, “suicidal thoughts”, “suicide hotline”, “how to hang yourself”, “clinical depression”, “what is depression”, and unemployment records. |
Statistical models result for the benchmark and the VAR approach.
| AR Benchmark | Google + | Reduced | ||
|---|---|---|---|---|
| “ | ||||
|
| 10.35 | 9.41 | 6.33 | 9.61 |
| Lag order | 2 | 3 | 24 | 24 |
Statistical models result for the NN approach.
| Benchmark | Google + | Reduced | ||
|---|---|---|---|---|
| “Feeling Down” | “Feeling Down” + Unemployment | |||
|
| 6.87 | 5.08 | 4.14 | 4.23 |
| Lag order | 12 | 12 | 12 | 12 |
Figure 4Model performance by the reduced model from the VAR and NNAR approaches for the year 2015.
Statistical model results for the UK applying the NNAR approach.
| UK Benchmark | UK | UK | |
|---|---|---|---|
|
| 26.41 | 25.14 | 6.01 |
|
| 2 | 2 | 2 |
Figure 5UK model performance in 2014 using the benchmark and Google + Unemployment approach.