| Literature DB >> 31578348 |
David Chandran1, Deborah Ahn Robbins2, Chin-Kuo Chang3, Hitesh Shetty4, Jyoti Sanyal4, Johnny Downs2,4, Marcella Fok2, Michael Ball2, Richard Jackson2, Robert Stewart2,4, Hannah Cohen2, Jentien M Vermeulen5, Frederike Schirmbeck5,6, Lieuwe de Haan5,6, Richard Hayes2.
Abstract
Obsessive and Compulsive Symptoms (OCS) or Obsessive Compulsive Disorder (OCD) in the context of schizophrenia or related disorders are of clinical importance as these are associated with a range of adverse outcomes. Natural Language Processing (NLP) applied to Electronic Health Records (EHRs) presents an opportunity to create large datasets to facilitate research in this area. This is a challenging endeavour however, because of the wide range of ways in which these symptoms are recorded, and the overlap of terms used to describe OCS with those used to describe other conditions. We developed an NLP algorithm to extract OCS information from a large mental healthcare EHR data resource at the South London and Maudsley NHS Foundation Trust using its Clinical Record Interactive Search (CRIS) facility. We extracted documents from individuals who had received a diagnosis of schizophrenia, schizoaffective disorder, or bipolar disorder. These text documents, annotated by human coders, were used for developing and refining the NLP algorithm (600 documents) with an additional set reserved for final validation (300 documents). The developed NLP algorithm utilized a rules-based approach to identify each of symptoms associated with OCS, and then combined them to determine the overall number of instances of OCS. After its implementation, the algorithm was shown to identify OCS with a precision and recall (with 95% confidence intervals) of 0.77 (0.65-0.86) and 0.67 (0.55-0.77) respectively. The development of this application demonstrated the potential to extract complex symptomatic data from mental healthcare EHRs using NLP to facilitate further analyses of these clinical symptoms and their relevance for prognosis and intervention response.Entities:
Mesh:
Year: 2019 PMID: 31578348 PMCID: PMC6775052 DOI: 10.1038/s41598-019-49165-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Key modifier words used in the natural language processing application for OCS.
| OCS Keywords | YBOCS Keywords | Patient Insight Keywords |
|---|---|---|
| Obses* (Includes variations such as ‘obsessive’ and ‘obsessional’) | Clean* (Includes variations such as ‘cleaned’ or ‘cleanliness’) | Distres* (Includes variations such as distressed or distressing) |
| Compul* Includes varations such as Compulsive or compulsively, but specifically excluding “compulsory”. | Wash* (Includes variations such as washing or washed) | Unwanted |
| OCD* (Includes variations such as OCD and O.C.D) | Check* (Includes variation such as checking and checked) | Repugnant |
| Hoard* (Includes varations such as Hoarding and Hoarded) | Repeat* (Includes varations such as repeatedly or repetitive) | Repulsive |
| Ritual* (Includes variations such as ‘ritualistic’ and ‘ritually’) | Count* (Includes variations such as counted or counting) | Egodystonic |
| Order* (Includes variations such as ordered or ordering) | Intrusive | |
| Counting | Intruding | |
| Rearrange*(Includes variations such as rearranging or rearranged) | Unable to stop |
List of terms used to identify candidates for exclusion.
| Form | Negation | Other Experiencer | Self-Description | Hedge |
|---|---|---|---|---|
| c - obsessive compulsive | None | Mother/Father | Self-Described | Seem(s) |
| hoarded materials blocking passages | deny* (includes variations such as denied and denying) | Sister/Brother | He/She describe(s/d) | Possible* (Including variations such as possibility/and possibly |
| obsessions and compulsions. none | Nil | Parent | Described Him/Herself | Apparent(ly) |
| Obsessive Compulsive Index (including variations such as o.c.i, oci) | no(t) obses* (includes variations such as obsessed, obsessions and obsessional) | Son/Daughter | Say(s) that | Sound(s) like |
| than (an) obses* (includes variations such as obsessed, obsessions and obsessional) | Sibling | told me | ||
| No History | Family | |||
| No Evidence | Boy/Girlfriend | |||
| Partner | ||||
| Husband/Wife | ||||
| qqqqq (a pseudonym for a family member or carer) |
Performance of individual components of the OCS algorithm in the validation set (300 documents) and the performance overall for detecting any OCS (including OCD) across all strings with Precision (positive predictive value) and recall (sensitivity) provided.
| Symptom | Precision | Recall |
|---|---|---|
| Obsessions | 0.73 | 0.5 |
| Compulsions | 0.63 | 0.83 |
| OCD | 1 | 0.85 |
| Hoard | 0.73 | 0.81 |
| Ritual | 1 | 0.33 |
| Any OCS (including OCD) | 0.77 | 0.67 |
Manual annotation rules for OCS.
|
| Text makes no mention of OCS |
| Text states that patient does not have OCS | |
| Text states that patient has either compulsions or obsessions, not both, and there is no information | |
| about any of the following: | |
| • Patient Distress | |
| • Obsessive or Compulsive symptoms described as egodystonic | |
| • Inability to stop Obsesions or Compulsions | |
| • Description of specific compulsions or specific obsessions | |
| • Patient Insight | |
| Text states that non-clinician observers (patient or family/friends) believe patient has obsessions | |
| or compulsions without describing YBOCS symptoms | |
| Text includes hedge words -i.e., possibly, apparently, seems -that specifically refer to OCS keywords | |
| Text includes risky, risk-taking, or self-harm behaviours | |
| Text includes romantic or weight-related (food related) words that modify OCS Keywords | |
|
| Text states that patient has OCD features/OCD Symptoms |
| Text states that patient has OCS | |
| Text includes hoarding, which was considered part of OCS, regardless of presence or absence of specific | |
| examples | |
| Text states that patient has at least 2 of the OCS keywords | |
| Text states that patient has either obsessive or compulsive or rituals or YBOCS and one of the following: | |
| • Obsessions or Compulsions are described as egodystonic | |
| • Intrusive, cause patient distress or excessive worrying/anxiety | |
| • Patient feels unable to stop obsessions or compulsions | |
| • Patient recognizes symptoms are irrational or senseless | |
| • Clinician provides specific YBOCS symptoms | |
| Text reports that patient has been diagnosed with OCD by clinician |