| Literature DB >> 33329090 |
André Bittar1, Sumithra Velupillai1, Johnny Downs1,2, Rosemary Sedgwick1,2, Rina Dutta1,2.
Abstract
Suicide is a serious public health issue worldwide, yet current clinical methods for assessing a person's risk of taking their own life remain unreliable and new methods for assessing suicide risk are being explored. The widespread adoption of electronic health records (EHRs) has opened up new possibilities for epidemiological studies of suicide and related behaviour amongst those receiving healthcare. These types of records capture valuable information entered by healthcare practitioners at the point of care. However, much recent work has relied heavily on the structured data of EHRs, whilst much of the important information about a patient's care pathway is recorded in the unstructured text of clinical notes. Accessing and structuring text data for use in clinical research, and particularly for suicide and self-harm research, is a significant challenge that is increasingly being addressed using methods from the fields of natural language processing (NLP) and machine learning (ML). In this review, we provide an overview of the range of suicide-related studies that have been carried out using the Clinical Records Interactive Search (CRIS): a database for epidemiological and clinical research that contains de-identified EHRs from the South London and Maudsley NHS Foundation Trust. We highlight the variety of clinical research questions, cohorts and techniques that have been explored for suicide and related behaviour research using CRIS, including the development of NLP and ML approaches. We demonstrate how EHR data provides comprehensive material to study prevalence of suicide and self-harm in clinical populations. Structured data alone is insufficient and NLP methods are needed to more accurately identify relevant information from EHR data. We also show how the text in clinical notes provide signals for ML approaches to suicide risk assessment. We envision increased progress in the decades to come, particularly in externally validating findings across multiple sites and countries, both in terms of clinical evidence and in terms of NLP and machine learning method transferability.Entities:
Keywords: electronic health records; machine learning; natural language processing; self-injurious behaviour; suicide attempted; suicide completed
Year: 2020 PMID: 33329090 PMCID: PMC7729078 DOI: 10.3389/fpsyt.2020.553463
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 4.157
Summarised characteristics of clinical cohorts created using CRIS for the study of suicide and related behaviour.
| Polling et al. ( | Adults attending ED | 7,444 | 10,688 ED attendances | N/A | 01/04/2009–31/12/2011 | ICD-10 codes X60-X84, presence of keywords related to self-harm, suicide attempts and suicidality |
| Bogdanowicz et al. ( | Patients with opioid use disorder | 5,335 | N/A | 15–73 years Mean (SD) = 37.6 (9.07) years | 01/04/2008–31/03/2014 | ICD-10 codes X409-X450, Y120, Y125, F119 |
| Lopez-Morinigo et al. ( | Patients with schizophrenia spectrum disorder | 426 (71 cases, 355 controls) | N/A | Mean (SD) = 44.9 (18.0) years | 01/01/2007–31/12/2013 | ICD-10 codes X64, X70, X71, X78, X80, X81, X84, Y10-34 |
| Lopez-Morinigo et al. ( | Patients accessing secondary mental healthcare | 13,758 | N/A | Mean (SD) = 41.3 (12.2) for suicide, 40.6 (11.5) for no suicide | 01/01/2007–01/04/2015 | ICD-10 codes X64, X70, X71, X78, X80, X81, X84, Y10-34 |
| Roberts et al. ( | Individuals with chronic fatigue syndrome | 2,147 | N/A | Mean = 39.1 years | 01/01/2007–31/12/2013 | ICD-10 codes X60-X84 |
| Taylor et al. ( | Perinatal women with SMI | 420 | N/A | Mean (SD) = 31.9 (6.2) years | 01/01/2007–31/12/2011 | Presence of keywords [from ( |
| Downs et al. ( | Children and adolescents with ASD | 1,906 | N/A | 14–18 years | 01/01/2008–31/12/2013 | NLP, manual classification of suicidality-related expressions |
| Velupillai et al. ( | Adolescents attending CAMHS | 23,455 | N/A | 11–17 years | 01/04/2009–31/03/2016 | Manual annotation of suicidality-related expressions, NLP |
| Bittar et al. ( | Patients accessing secondary mental healthcare | 17,640 (2,913 cases, 14,727 controls) | 21,175 admissions (4,235, cases, 16,940 controls) | Mean (SD) = 33.7 (15.6) years | 02/04/2006–31/03/2017 | X6 |
Due to indeterminacy of intent, suicide by overdose and fatal drug poisonings are grouped together.
indicates all codes that begin with the given sequence.